Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Text modal and image modal crossing type data retrieval method

A modal and data technology, applied in the field of data retrieval across text modalities and image modalities, can solve problems such as inability to guarantee the optimality of solutions, increase the difficulty of optimization solutions, and inconvenient investment.

Active Publication Date: 2015-12-30
天津中科智能识别有限公司
View PDF3 Cites 26 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

On the one hand, these constraints increase the difficulty of the optimization solution, on the other hand, the optimized solution after scaling cannot guarantee the optimality of the solution
Among them, taking the low-rank constraint as an example, the multi-multiplier alternating direction method ADMM iterative optimization algorithm is usually used to deal with the low-rank constraint. In this way, the coexistence of the F norm and the nuclear norm will inevitably occur in each iteration. In this case, eigenvalue decomposition is used to optimize the solution. However, with the increase of samples, the time and space complexity of matrix eigenvalue decomposition is too large, which is not convenient for practical application.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text modal and image modal crossing type data retrieval method
  • Text modal and image modal crossing type data retrieval method
  • Text modal and image modal crossing type data retrieval method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] In order to enable those skilled in the art to better understand the solution of the present invention, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments.

[0042] The invention provides a data retrieval method across text modalities and image modalities, which aims at the time and space complexity of the cross-modal retrieval algorithm optimization algorithm based on subspace learning in the actual environment and the effect is not ideal Problem, the present invention rationally utilizes the non-linear feature expression ability through the cross-modal model based on the principal affinity, and uses the semantic standard information as a shared subspace, avoiding the optimization problem faced by the traditional subspace learning, so that the retrieval method finally obtained has more Good adaptability, and achieved the best results in the experiment. On the one hand, the invention can reduce t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a text modal and image modal crossing type data retrieval method. The method comprises the following steps: dividing all text and image modal data into multiple samples, and carrying out co-clustering; carrying out main affinity calculation on all data according to a co-clustering center; taking sense vectors of the text modal data and the image modal data as an output expression of a Logistic regression classifier, centralizing a main affinity non-linear expression, and training the main affinity non-linear expression as an input expression, so as to obtain multiple classification functions; when a user needs to retrieve the text or image modal data sample, respectively calculating main affinities, inputting the main affinities into the classification functions to obtain a semantic layer expression of the text or image modal data sample, and carrying out normalization to generate a final expression; and calculating a retrieval result by virtue of an inner product calculation formula. According to the text modal and image modal crossing type data retrieval method, the modal crossing retrieval can be rapidly and effectively realized, and the modal crossing retrieval time can be remarkably shortened.

Description

technical field [0001] The invention relates to the technical fields of data retrieval such as computer vision, pattern recognition, and multimedia retrieval, and in particular relates to a data retrieval method across text modalities and image modalities. Background technique [0002] Currently in the era of mobile Internet, the amount of data is increasing day by day, especially the vast majority of data carry multimodal information. Among them, taking a web page as an example, a web page file includes both text information and image information, how to rationally use multi-modal information design to realize a more humanized search engine has attracted people's attention. However, it is worth noting that the two modalities, text modal and image modal, are not symmetrical at the level of feature expression. In addition, the length and distinguishing ability of these two modal features are quite different, which is Cross-modal retrieval poses great challenges. At present,...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
CPCG06F16/285G06F16/35G06F16/951G06F16/9535
Inventor 赫然谭铁牛孙哲南梁坚
Owner 天津中科智能识别有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products