Unlock instant, AI-driven research and patent intelligence for your innovation.

An Online Hash Nearest Neighbor Query Method Based on Data Block Learning

A query method and data block technology, applied in the field of online nearest neighbor query, can solve problems such as time-consuming, inefficient dynamic data, and inability to read data into memory, and achieve the goal of improving learning efficiency, reducing storage space, and improving query efficiency. Effect

Active Publication Date: 2022-04-01
恩施安贝森科技服务股份有限公司
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Therefore, for large-scale data, because the hard disk space is much larger than the memory, it is impossible to read all the data into the memory, and it is also very time-consuming to process and calculate a large amount of data at one time.
However, most of the current methods are based on the hash learning method of batch processing technology, which is equivalent to considering all the data and retraining the hash function when new data arrives, which is inefficient for streaming dynamic data
[0003] At present, online hash learning nearest neighbor search based on a pair of data, although the update frequency and stability of the learned hash has been improved, but the actual processing mechanism is still updated based on a pair of data points, in order to speed up the hash function Update efficiency, propose a nearest neighbor query method based on online hash learning on block data
[0004] The goal of online hash learning is to process streaming data sequentially, but it relies on current training data. There have been a lot of related algorithm research, but there is less research on online hash functions based on data block learning.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • An Online Hash Nearest Neighbor Query Method Based on Data Block Learning
  • An Online Hash Nearest Neighbor Query Method Based on Data Block Learning
  • An Online Hash Nearest Neighbor Query Method Based on Data Block Learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] The present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments.

[0029] An online hash nearest neighbor query method based on data block learning, comprising the following steps:

[0030] ① Image data acquisition and preprocessing: acquire the original two-dimensional image data set containing the original two-dimensional image, convert the original two-dimensional image data set equivalently into a numerical matrix that retains the original features according to the image pixel information, and perform data processing on the numerical matrix Cleaning and dimensionality reduction are two-step operations, and the specific operation process is as follows:

[0031] ①-1 Use binning, clustering and regression methods to manually process the outlier values ​​in the original two-dimensional image data set, replace the outlier values ​​with the mean value, and complete the normalization operation on the original tw...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an online hash nearest neighbor query method based on data block learning, which is characterized in that it includes the following steps: acquiring and preprocessing image data, defining a hash model for processing data, establishing and judging whether the updated hash vector is reasonable The Hamming distance predicts the loss function, obtains the objective function, optimizes the objective function, and performs online hash nearest neighbor query on the given data to be queried in the test database; the advantage is that it is mainly based on the idea of ​​​​data blocks, and each time processing stream small data block, and design an optimization algorithm in a smaller data space to improve learning efficiency. In the design method, the Hamming space loss between data samples inside the data block is guaranteed to be minimal, and at the same time, the incremental change of the overall online learning is also constrained. , the noisy data is effectively controlled, the query efficiency of the given data to be queried is greatly improved, and the overall storage space is reduced.

Description

technical field [0001] The invention relates to an online nearest neighbor query method, in particular to an online hash nearest neighbor query method based on data block learning. Background technique [0002] Nearest Neighbor Search is an important research direction in the field of information retrieval, and it is widely used in image retrieval and data mining. In the hash-based nearest neighbor algorithm, the original data is mapped to the binary code in the Hamming space through the hash function, and the Hamming distance coding is used to be as close as possible to the original input space data, and the XOR operation using the speed of the computer close to the hardware has high efficiency. It has the advantages of data processing and search efficiency, so it is widely used. In practical applications, there are various forms of data, most of which are dynamically generated, such as the amount of new web pages, flight and railway traffic, and weather condition informat...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/22G06F16/23G06F16/2455
Inventor 胡伟钱江波任艳多孙瑶
Owner 恩施安贝森科技服务股份有限公司