Unlock instant, AI-driven research and patent intelligence for your innovation.

Index construction method and device, vector search method and retrieval system

A construction method and index technology, applied in the field of data retrieval, can solve the problem of low accuracy of retrieval results, and achieve the effect of improving search accuracy

Pending Publication Date: 2022-07-26
上海徐毓智能科技有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In order to solve the problem of low accuracy of the retrieval results of the vector retrieval method, the embodiment of the present application provides an index construction method, device, vector retrieval method and retrieval system

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Index construction method and device, vector search method and retrieval system
  • Index construction method and device, vector search method and retrieval system
  • Index construction method and device, vector search method and retrieval system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0094] Illustrative embodiments of the present application include, but are not limited to, an index construction method, apparatus, data system, and search method.

[0095] As mentioned above, the accuracy of retrieval results obtained by constructing an inverted index in the current database will be low.

[0096] E.g, Figure 1a In is a schematic diagram of a picture database S in a retrieval system, and the database S includes each feature vector corresponding to each picture. Figure 1b In is a schematic diagram of constructing an inverted index for the image database S. like Figure 1b As shown, each feature vector in the database S is firstly assigned to four cluster sets through clustering processing, which are respectively cluster set S1, cluster set S2, cluster set S3 and cluster set S4, among which, cluster set S1, cluster set S2, cluster set S3 and cluster set S4 The set S1 has a corresponding representative point C1, the cluster set S2 has a corresponding represent...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the technical field of data retrieval, and discloses an index construction method and device, a vector search method and a retrieval system. The index construction method comprises the steps that a target vector and a first clustering set where the target vector is located are determined, and the first clustering set has a corresponding first representative point; determining at least one second representative point meeting a preset condition for the first clustering set; index association between the target vector and the first representative point and the index association between the target vector and the second representative point are established respectively. Based on the above scheme, in the subsequent vector retrieval process, the distances between the first representative point and the query vector and between the second representative point and the query vector can be calculated respectively to query the corresponding target cluster set, so that the target vector corresponding to the query vector is obtained more accurately, and the query efficiency is improved. The problem that the search precision is low due to the fact that the center point of the clustering set cannot completely represent all the feature vectors in the clustering set can be effectively avoided, and the search precision is effectively improved.

Description

technical field [0001] The present application relates to the technical field of data retrieval, and in particular, to an index construction method, device, vector search method and retrieval system. Background technique [0002] With the rapid growth of data, data retrieval is widely used in image, video, speech, protein molecular structure retrieval and other fields. Since all kinds of data, such as picture data, can be abstracted into high-dimensional feature vectors, the difference between data The similarity can be quantified as the distance between feature vectors in the vector space. For example, the closer the distance between two feature vectors, the higher the similarity of the original data corresponding to the two feature vectors. Therefore, data retrieval can be transformed into vector search in the vector space, that is, the process of searching the database for several data similar to the data to be queried is transformed into the process of searching the dat...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/901G06F16/906G06F16/903G06F16/51G06F16/55G06F16/53G06F16/31G06F16/35G06F16/33G06F16/61G06F16/65G06F16/63G06F16/71G06F16/75G06F16/73G16B50/30
CPCG06F16/901G06F16/906G06F16/90335G06F16/51G06F16/55G06F16/53G06F16/319G06F16/353G06F16/3347G06F16/61G06F16/65G06F16/63G06F16/71G06F16/75G06F16/73G16B50/30
Inventor 谢超许维芷程倩雅易小萌
Owner 上海徐毓智能科技有限公司
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More