Cross-modal image-text mutual indexing method based on self-attention reasoning
An attentional, cross-modal technology applied at the intersection of vision and language to achieve improved accuracy and stability
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0028] The present invention will be further described in conjunction with the accompanying drawings and specific embodiments. It should be understood that these examples are only used to illustrate the present invention and are not intended to limit the scope of the present invention. In addition, it should be understood that after reading the content taught by the present invention, those skilled in the art may make various changes or modifications to the present invention, and these equivalent forms also fall within the scope defined in the present application.
[0029] See attached figure 1 , 2 , a self-attention reasoning-based cross-modal graphic-text mutual search method designed by the present invention, the specific implementation process is as follows:
[0030] Step 1: Get the data set, get the paired original image data and text annotation data, and divide it into training data set, verification data set and test data set.
[0031] The multimodal datasets used fo...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


