Unlock instant, AI-driven research and patent intelligence for your innovation.

A fast approximate k-nearest neighbor method based on tree strategy and balanced k-means clustering

A technology of K-means and K-nearest neighbors, which is applied in the directions of instruments, calculations, character and pattern recognition, etc., can solve the problems of low algorithm efficiency, achieve the effects of eliminating uncertainty, improving search accuracy, and reducing search time

Active Publication Date: 2021-08-10
NORTHWESTERN POLYTECHNICAL UNIV
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

For example, K-nearest neighbor retrieval in large-scale high-dimensional data has always been one of the hot issues that are difficult to overcome. In the tree structure method, KD tree, KD random forest, etc. have good results, but in the KD tree algorithm, The retrieval process needs to go back to the previous node continuously. The higher the dimension, the more backtracking times are required, and the efficiency of the algorithm will be lower. In KD random forest, although the backtracking problem can be alleviated to a certain extent, due to KD Random forest uses multiple KD trees to search together. How to balance memory usage and algorithm efficiency has become a new problem

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A fast approximate k-nearest neighbor method based on tree strategy and balanced k-means clustering
  • A fast approximate k-nearest neighbor method based on tree strategy and balanced k-means clustering
  • A fast approximate k-nearest neighbor method based on tree strategy and balanced k-means clustering

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0012] The present invention will be further described below in conjunction with the accompanying drawings and embodiments, and the present invention includes but not limited to the following embodiments.

[0013] Such as figure 1 As shown, the present invention provides a fast approximate K-nearest neighbor method based on the tree strategy and balanced K-means clustering, which mainly consists of two main steps of building a balanced tree and finding K-nearest neighbors. The basic implementation process is as follows:

[0014] 1. Building a Balanced Tree

[0015] First, the balanced K-means clustering algorithm is used to cluster the input image data set, and the cluster centers of two types of image samples with equal sample numbers are obtained. Specifically:

[0016] The two types of balanced K-means clustering algorithm models are as follows:

[0017]

[0018] Among them, C is the center of the cluster, G is the index matrix, and X is the input image data set, wher...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a fast approximating K nearest neighbor method based on tree strategy and balanced K-means clustering, so as to improve the performance and speed of approximate K neighbor retrieval. First of all, the balanced K-means tree is constructed by the balanced K-means clustering method, so that the data can be efficiently and orderly organized in a tree structure, and the rapid positioning of any new sample data can be realized; then, using the anchor location method and the idea of ​​adjacent clusters, Quickly find multiple approximate neighbors of new data samples through the balanced tree, that is, K nearest neighbors. The method of the invention takes into account the advantages of the tree-based K-nearest neighbor algorithm and the balanced K-means algorithm, and can be applied to multiple fields such as image recognition, data compression, pattern recognition and classification, machine learning, document retrieval systems, statistics and data analysis, and the like.

Description

technical field [0001] The invention belongs to the technical field of machine learning and data mining, and in particular relates to a fast approximate K nearest neighbor method based on tree strategy and balanced K-means clustering. Background technique [0002] In the era of mobile Internet, people's daily life is faced with the impact of massive data every day, such as personal information, video records, image collection, geographic information, log files, etc., in the face of such a large and growing data information, how to process all The effective storage, indexing and querying of the required information is a hot research topic both at home and abroad. [0003] Approximate K-nearest neighbor retrieval was initially applied to document retrieval systems as a method for finding similar document information, and then in geographic information systems, K-nearest neighbor retrieval was also widely used in location information, query, analysis and statistics of spatial d...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06K9/62
CPCG06F18/23213G06F18/24147G06F18/24323
Inventor 聂飞平车昊轩王宏王榕于为中李学龙
Owner NORTHWESTERN POLYTECHNICAL UNIV