Image description method of bidirectional multi-mode recursive network
An image description and multi-modal technology, applied in the direction of still image data retrieval, still image data index, biological neural network model, etc., can solve the problems of keeping unchanged, changing, and losing visual information, and achieve performance and accuracy improvement , rich visual information, easy to train the effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0018] combine figure 1 , an image description method for a bidirectional multimodal recurrent network, comprising the following steps:
[0019] Step 1, download the image description dataset, and obtain the images in the dataset and their corresponding description sentences;
[0020] Step 2, process the sentences in the training set, extract the words that appear in the sentences and build a vocabulary;
[0021] Step 3, using the pre-trained convolutional neural network to extract the features of the images in the data set;
[0022] Step 4, build a bidirectional multimodal recursive network, and fuse the extracted image features with the corresponding text features;
[0023] Step 5, the network model considers the historical and future text information, combines the fused image features, uses the training set to train the model and makes it converge;
[0024] Step 6: Input a picture into the pre-trained bidirectional multimodal recurrent network model to obtain the corresp...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com