An image description network and technology based on attribute enhancement attention model
An attention model and image description technology, applied in the field of neural networks, can solve the problems of redundant regional features, scattered attention model weights, not considering whether there is key information, etc., to achieve the effect of improving the ability.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0033] An image description network based on attribute-enhanced attention model, including:
[0034] Attribute prediction model: used to use attention technology to use image features as input to obtain prediction results of attribute words; the image features are grid area features of images extracted using convolutional neural networks;
[0035] Sentence generation model: used to generate sentences by using the prediction results of the attribute prediction model as input through attention technology; the prediction results of the attribute prediction model include attribute word distribution information and visual features corresponding to each attribute word.
[0036] In the image description network provided in this embodiment, before constructing and training the image description network, it is first necessary to obtain a data set that can be used for image description and perform data preprocessing on the data set. The dataset consists of images and sentences describin...
Embodiment 2
[0049] An image captioning technique based on an attribute-enhanced attention model, including:
[0050] S1. Obtain a data set for image description and perform data preprocessing on the data set; the data set includes images and sentences used to describe the images; data preprocessing includes extracting attribute words from images and preprocessing sentences;
[0051] S2. Utilize the convolutional neural network and the recurrent neural network to construct the image description network based on the attribute-enhanced attention model as described in embodiment 1, and the image description network includes an attribute prediction model and a sentence generation model;
[0052] S3. First, use attention technology to input image information and attribute word information into the attribute prediction model, and train the attribute prediction model through the cross-entropy loss function as shown below,
[0053]
[0054] In the above formula, V is the grid area feature extra...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com