Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Random structure conformal Hash information retrieval method

A technology of information retrieval and shape-conserving hashing, which is applied in special data processing applications, instruments, electrical digital data processing, etc., and can solve the problem that hash algorithms cannot obtain effective results, linear hash functions cannot map data point relationships, Dimensional data cannot inherit high-dimensional data to the greatest extent

Inactive Publication Date: 2015-02-25
NANJING UNIV OF INFORMATION SCI & TECH
View PDF3 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Due to the effect of sparse coding, in order to ensure the sparsity of the obtained representation, the non-negative local coordinate factorization (NLCF) adds local coordinate constraints
[0013] In summary, the deficiencies of the existing technology can be summarized as follows: First, the existing NMF algorithm cannot solve the problem of protecting the local and overall structure of the original high-dimensional data, so the obtained low-dimensional data cannot be maximized. Inherit the characteristics of high-dimensional data; second, the existing hash algorithm based on random projection has to generate a lot of hash tables to obtain a certain retrieval effect, and the simple linear hash function cannot map out the potential between data points. Third, when the codeword is very long, the learning-based hash algorithm cannot achieve effective results

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Random structure conformal Hash information retrieval method
  • Random structure conformal Hash information retrieval method
  • Random structure conformal Hash information retrieval method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0090] Embodiment 1, the random structure conformal hash information retrieval method of the present invention is used to solve the problem in similarity search. There are two large-scale databases: one is SIFT1M based on the SIFT operator, and the other is GIST1M based on the GIST operator; among them, the SIFT database has 1 million data points with a dimension of 128, and the GIST database has 1 million data points with a dimension of 960 data points; the basic parameters of the two large databases in the similarity search are detailed in Table 1.

[0091] Table 1: Table of basic parameters for two large databases in similarity search

[0092] database

SIFT dim = 128

GIST dim = 960

database size

1,000,000

1,000,000

The size of the test sample

10,000

10,000

The size of the training sample

990,000

990,000

[0093] In order to protect as many important structures of high-dimensional data as possible, the pre...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a random structure conformal Hash information retrieval method. The random structure conformal Hash information retrieval method is characterized by including the steps of (1) protecting important structures of high dimensional data, conducting dimensionality reduction on original high dimensional data through a provided objective function, and accordingly obtaining low dimensional data; (2) calculating a basic dimensionality matrix and a low dimensionality matrix of the original high dimensional data through obtained updating rules of basic operators U and the low dimensional data V; (3) setting a threshold value, converting low dimensionality real number expressions in a training set into binary codes, and calculating Hash codes of a testing sample through probability statistics disaggregated model logistic regression; (4) calculating the Hamming distance, namely XOR operation, between the training data and the testing sample, and obtaining final results. By means of the random structure conformal Hash information retrieval method, on the basis that distribution of random data and the local and overall structures of the high dimensional data are protected, a Hash function is successfully obtained through the multivariable logistic regression, surpassing sample expansion can be achieved, and the random structure conformal Hash information retrieval method is suitable for computer visions, data mining, machine learning or similar searching fields.

Description

technical field [0001] The invention belongs to the technical field of computer information data processing, and in particular relates to a random structure shape-preserving hash information retrieval method for computer vision, data mining, machine learning or similar search. Background technique [0002] Similarity search is a problem to be solved in information retrieval, machine learning, pattern recognition and data mining. In general, effective similarity search methods build index structures in the metric space, and early research on similarity search can be traced back to the 1970s. Specifically, when the dimensionality is lower than or equal to 20, some methods based on data structures such as KD-tree, VP-tree and R+tree can solve the problem of similarity search. However, with the increase of data dimensions, the difficulty of how to effectively implement similarity search in the field of information data processing continues to increase. The existing methods ado...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/325G06F16/2255G06F16/9014
Inventor 邵岭蔡子贇刘力余孟洋
Owner NANJING UNIV OF INFORMATION SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products