High-dimension vector rapid searching algorithm based on block distance

A block distance and retrieval algorithm technology, applied in the field of data processing
CN102306202AInactive Publication Date: 2012-01-04COMMUNICATION UNIVERSITY OF CHINA

Patent Information

Authority / Receiving Office
CN Β· China
Current Assignee / Owner
COMMUNICATION UNIVERSITY OF CHINA
Publication Date
2012-01-04
Estimated Expiration
Not applicable Β· inactive patent

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The invention provides a high-dimension vector rapid searching algorithm based on a block distance and belongs to the field of data processing such as multimedia information searching, intelligent information processing, data mining, and the like. In the invention, an index structure Block B-tree which is converted from high dimension to one dimension and is based on the block distance is provided; a high-dimension vector is mapped into one-dimensional key values by adopting the block distance of the high-dimension vector to a reference point; and the index structure B+-tree is used for managing the key values, and each key value of a leaf node layer is bound with a pointer pointing to a corresponding high-dimension vector. During searching, the same mapping method is used for mapping a query vector into one-dimension query key values, and then similarity calculation is only performed on the high-dimension characteristics of the key values close to the query key values, thereby reducing the calculated quantity and greatly increasing the searching speed. In a similarity matching algorithm of the high-dimension vector, the block distance is a frequently-used measurement way, the operation of the algorithm is simple, and the searching efficiency is higher, while most of the current index structures are provided based on Euclidean distance matching measurement. The index structureprovided by the invention not only supports searching based on the Euclidean distance matching way but also directly supports searching based on the block distance measurement way.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The invention belongs to the fields of data processing such as multimedia information retrieval, intelligent information processing, and data mining, and specifically relates to a high-dimensional vector fast retrieval algorithm based on block distance. Background technique

[0002] With the development of computer and information technology, a large amount of multimedia data has been produced. How to quickly find the required information in the massive multimedia database is a key issue in the field of multimedia database research. The traditional method is to manually mark the multimedia data, and then realize the multimedia information retrieval through text retrieval. However, manual labeling has the disadvantages of heavy workload and strong subjectivity. For the explosive growth of multimedia data, manual labeling is impossible. Therefore, it is necessary to study content-based multimedia information retrieval technology.

[0003] The technical ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More