Text and image-oriented cross-media retrieval method and electronic device
A cross-media and text technology, applied in the field of text- and image-oriented cross-media retrieval methods and electronic devices, can solve the problems of insufficient association relationship mining, unequal information, noise, etc., and achieve the effect of improving the effect of image-text matching.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0045] In order to make the object, principle, technical solution and advantages of the present invention clearer, the present invention will be described in detail below in conjunction with specific embodiments and with reference to the accompanying drawings.
[0046] The present invention first performs symbolic representation of images and texts. Set the number of words in each text as T, and each text is expressed as S={s 1 ,...,s T}, where s t is the feature vector of the t-th word. Image I is denoted as V = {v 1 ,...,v N}, where v n is the feature vector of the nth region, and N indicates that there are N targets extracted from the image. The speech P is denoted as P={p 1 ,...,p M}, where p m Is the feature vector of the mth frame, and M means that M frames are extracted from the total speech.
[0047] The general framework of the model of the present invention includes three parts, which are text feature representation fused with speech, region feature represe...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


