Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Image text matching method and device, electronic equipment and storage medium

A text and image technology, applied in character and pattern recognition, instruments, biological neural network models, etc., can solve problems such as reducing image-text matching effect, reducing retrieval effect, and interference.

Pending Publication Date: 2021-06-08
BEIJING UNISOUND INFORMATION TECH +1
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the above matching process is easily disturbed by polysemous words or locally similar images, which reduces the matching effect of image text; and when disturbed by text or image noise, the retrieval effect will be greatly reduced

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Image text matching method and device, electronic equipment and storage medium
  • Image text matching method and device, electronic equipment and storage medium
  • Image text matching method and device, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] The principles and features of the present invention are described below in conjunction with the accompanying drawings, and the examples given are only used to explain the present invention, and are not intended to limit the scope of the present invention.

[0040] figure 1 An image-text matching method provided by an embodiment of the present invention.

[0041] In the first aspect, the embodiment of the present invention provides a method for image-text matching, including the following four steps:

[0042] S101: Obtain image features and text features.

[0043] Specifically, in the embodiment of the present application, a convolutional neural network is used to obtain image features, and a word vector model and a cyclic neural network are used to obtain text features.

[0044] S102: Perform image enhancement processing on the image features to generate two noise-containing image features.

[0045] Perform text enhancement processing on the text features to generat...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to an image text matching method and device, electronic equipment and a storage medium. The method comprises the steps of acquiring image features and text features; performing image enhancement processing on the image features to generate two image features containing noise; performing text enhancement processing on the text features to generate two text features containing noise; processing the two noise-containing image features by using an image memory unit added with noise; processing the two noisy text features by using a noisy text memory unit; and respectively constraining the image memory unit and the text memory unit. According to the embodiment of the invention, through the key memory constraint, the semantic consistency between the image with the noise and the text memory is higher than other memories generated by different images and texts, so that the matching effect is improved; and through addition of two different noises and comparative learning, a good matching effect can still be obtained when image text input is interfered, and the matching robustness is improved.

Description

technical field [0001] The present invention relates to the technical field of image-text matching, in particular to an image-text matching method, device, electronic equipment and storage medium. Background technique [0002] At present, the image-text matching model generally consists of two main modules: feature extraction and feature retrieval. Feature extraction models usually use convolutional neural networks to extract image features, word vector models and cyclic neural networks to extract text features, and map the two features into the same multimodal space; feature retrieval models usually use distance metric functions to measure the distance between the query vector and other feature vectors in the multimodal space, resulting in a retrieval ranking table based on the distance scores. However, the above matching process is easily disturbed by polysemous words or locally similar images, which reduces the matching effect of image text; and when disturbed by text or...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06K9/36G06K9/46G06F40/30G06N3/04
CPCG06F40/30G06V10/20G06V10/40G06N3/044G06F18/22Y02D10/00
Inventor 宁学成刘青松梁家恩
Owner BEIJING UNISOUND INFORMATION TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products