Image description method based on space-time memory attention
An image description and attention technology, applied in neural learning methods, neural architectures, biological neural network models, etc., can solve problems such as ignoring the timing of language expression, hindering the improvement of image description effects, and inaccurate attention feature acquisition.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0025] The following takes the MS COCO image description data set as an example to illustrate the specific implementation steps of the present invention:
[0026] Step (1) Obtain and preprocess the MS COCO image description dataset:
[0027] Step (1.1) Obtain the MS COCO image description data set, which contains image data I and its corresponding standard description data The download address of the MS COCO dataset is http: / / cocodataset.org / #download. The dataset contains a total of 164,062 pictures. The sizes of the training set, validation set, and test set are 82,783, 40,504, and 40,775, respectively. Except for the test set, each A picture also contains at least 5 corresponding standard descriptions as labels, some samples are figure 1 shown.
[0028] Step (1.2) describes the data in MS COCO Do preprocessing. Set the maximum length of the image description to 16, replace words with a word frequency less than 6 with "UNK" to reduce the interference of a few noise wor...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com