Neighbor searching method and neighbor searching system

A search method and search system technology, applied in the field of neighbor search methods and systems, can solve problems such as slowness of massive high-dimensional data and decrease in retrieval speed, and achieve the effect of ensuring accuracy and high search speed.

Inactive Publication Date: 2013-04-03
GUANGDONG TUTUSOU NETWORK TECH
View PDF2 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] If the query data points are compared with all points in the database one by one, although the retrieval accuracy can be guaranteed, it will become very slow for massive high-dimensional data
If the traditional tree-based neighbor search technology wants to get a relatively high accuracy rate, the retrieval speed will drop rapidly as the data dimension increases.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Neighbor searching method and neighbor searching system
  • Neighbor searching method and neighbor searching system
  • Neighbor searching method and neighbor searching system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] The present invention is a neighbor search method and system based on a hash algorithm, which converts data into binary strings of short digits, and then establishes a hash table to achieve the purpose of efficient search. Since no matter how many dimensions the data is, it is finally converted into a short-digit binary string (for example: 0110 is a 4-digit binary string), so the neighbor search based on the hash algorithm is not sensitive to the dimension, and can quickly retrieve high-level data. dimension data. The present invention will be explained in detail below in conjunction with the accompanying drawings and embodiments.

[0030] The nearest neighbor search method of the present invention, such as figure 1 As shown, it includes two steps of offline learning and online search.

[0031] The online learning process is as figure 1 Shown:

[0032] Step S101, randomly and uniformly select several data points in the training data set as anchor points, obtain a k...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a neighbor searching method and a neighbor searching system. The neighbor searching method includes offline learning and online searching. A learning function of data point number in small regions on two sides of a Hash hyperplane is minimized to enable the Hash hyperplane to penetrate through a data sparse region, and accordingly high accuracy in neighbor searching can be guaranteed. An approximate balance barrel regular term is added to a learning function through approximate balance barrel conditions to enable more balanced data point partition by the Hash hyperplane, and accordingly high searching speed in neighbor searching is guaranteed. Regardless of a small quantity of data or massive data, high-accuracy and high-speed neighbor searching can be realized by the neighbor searching method and the neighbor searching system.

Description

technical field [0001] The invention relates to the technical field of information retrieval, in particular to a neighbor search method and system. Background technique [0002] With the rapid development of information technology and the improvement of data collection capabilities, the volume and dimension of data in various fields have increased exponentially. However, the rapid growth of data volume and data dimensions makes searching extremely difficult. For example, for an input picture, when we need to find pictures that are the same or similar to this picture in a massive picture library, we must not only find it accurately, but also find it quickly. In this example, we can abstract the image into data points. The similarity between data points can usually be measured by Euclidean distance. Nearest neighbor search refers to finding similar data points with the query data points under this similarity measure. [0003] If the query data points are compared with all po...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 钟海兰
Owner GUANGDONG TUTUSOU NETWORK TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products