A Multimodal Search Method Based on Neighbor Graph

A search method and the technology of the nearest neighbor graph, applied in the search field, can solve problems such as inflexibility, inability to freely control the importance of modes, low recall rate, etc., and achieve high search efficiency

Active Publication Date: 2022-03-01
HANGZHOU DIANZI UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The current multimodal search mainly adopts two methods: the first is to perform single-modal search on each modality in the multimodal data, and then merge the search results. It is obvious that the efficiency of this method will increase. It decreases significantly with the increase of the number of modalities and the amount of data; the second is to map the data of each modality to a unified multimodal vector space through the learning method and then search. The disadvantage of this method is After the mapping model is trained, the importance of a modality cannot be freely controlled in the search phase, which is not flexible enough and the recall rate is very low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Multimodal Search Method Based on Neighbor Graph
  • A Multimodal Search Method Based on Neighbor Graph

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0039] A multi-modal search method based on the neighbor graph provided in this embodiment, its flow chart is as follows figure 1 As shown, in this embodiment, a multimodal search for recipes based on the neighbor graph is used as an example, and the flow chart for further illustration is as follows figure 2 shown.

[0040] First, step S1 is performed to obtain a reference data set, which contains multiple reference objects, and each reference object contains multiple modal data; the modal data types of each reference object are the same.

[0041] Then proceed to step S2 to generate a corresponding feature vector for each modality data, and all feature vectors of each reference object form a reference object vector. The feature vectors are generated using a pre-model. For example, for text data such as raw material data and cooking process data, the existing BERT model can be used to train and extract features and transform them into raw material feature vectors and cooking ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a multi-modal search method based on a neighbor graph. Firstly, each modal data of each reference object in a reference data set is generated into a eigenvector, and then independently calculated according to each eigenvector, and then calculated by an aggregation function fusion to obtain each eigenvector. Query the fusion distance between objects to construct a neighbor graph of reference objects. Then, a query vector containing multiple feature vectors is generated according to the query content, and the query vector is used to perform a multimodal search on the neighbor graph to obtain the most similar query target. The method of the present invention simultaneously queries multiple modalities of the object by querying the fusion distance, and can change the influence weight of different modalities on the fusion distance by adjusting the aggregation function, thereby realizing the importance of the modalities in the search process Flexible manipulation, and improve the efficiency and accuracy of the search.

Description

technical field [0001] The invention belongs to the technical field of search, and in particular relates to a multi-modal search method based on a neighbor graph. Background technique [0002] With the continuous development of the Internet, various applications continue to generate and eventually gather massive amounts of text, pictures, audio, and video data. The multimodality and massiveness of data pose a huge challenge to information retrieval. With the research and progress of intelligent technology, various modal data can be extracted by artificial intelligence and converted into feature vectors for various similarity calculations and various extended applications. Therefore, the multimodal search method Research is very important. [0003] The current multimodal search mainly adopts two methods: the first is to perform single-modal search on each modality in the multimodal data, and then merge the search results. Obviously, the efficiency of this method will increas...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/953
CPCG06F16/953
Inventor 徐小良吕凌威王梦召
Owner HANGZHOU DIANZI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products