Image description generation method and system based on unsupervised uniqueness optimization
An image description and generation system technology, applied in neural learning methods, biological neural network models, special data processing applications, etc., can solve the problems of lack of diversity and vividness of sentence descriptions, large differences in descriptions, and unstable training. Achieve the effect of avoiding training instability and loss monitoring difficulties, good diversity, and improving the quality of descriptions
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0062] combined with figure 1 , this embodiment proposes an image description generation method based on unsupervised uniqueness optimization, and the implementation process of the method includes:
[0063] S1. Obtain paired images and real sentence descriptions generated by humans, and store them in the training set;
[0064] S2. Using the paired data included in the training set to train the image description retrieval model 10 in the SentEval tool.
[0065] S3. Construct the image description generation model 4 using the encoder-decoder framework.
[0066] In this embodiment, the encoder uses ResNet-101 pre-trained on ImageNet;
[0067] The decoder uses a two-layer LTSM with an attention mechanism, the first layer LSTM focuses on visual information, and the second layer LSTM focuses on language information.
[0068] S4. Acquire the images of the training set and input them into the image description generation model 4. The image description generation model 4 generates s...
Embodiment 2
[0085] combined with figure 2 , this embodiment proposes an image description generation system based on unsupervised uniqueness optimization, which includes:
[0086] Obtain storage module 1, which is used to obtain paired images and real sentence descriptions generated by humans, and store them in the training set;
[0087] The training module 2 is used to train the image description retrieval model 10 of the SentEval tool using the paired data contained in the training set;
[0088] Building block 3 for building an image description generation model 4;
[0089] The split processing module 5 is used to obtain the images of the training set and divide them into multiple batches, and is also used to sequentially and circularly input the images contained in the multiple batches into the image description generation model 4;
[0090] The image description generation model 4 is used to obtain the images of the training set and generate sentence descriptions corresponding to th...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com