Data dimension reduction method based on neighborhood similarity

A technology of data dimensionality reduction and similarity, applied in the field of data processing, to achieve the effect of improving performance, improving recognition performance, and avoiding information redundancy

Active Publication Date: 2015-06-03
XIDIAN UNIV
View PDF3 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] The purpose of the present invention is to address the shortcomings of the above-mentioned prior art, and propose a data dimensionality reduction method based on neighborhood similarity, so as to effectively realize the feature extraction and reduction of data in the case of unbalanced data structure distribution. Dimensions, improve classification recognition effect

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data dimension reduction method based on neighborhood similarity
  • Data dimension reduction method based on neighborhood similarity
  • Data dimension reduction method based on neighborhood similarity

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] The specific implementation steps and effects of the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0037] refer to figure 1 , the implementation steps of the present invention are as follows:

[0038] Step 1. Input the original image.

[0039] Input n=F×P original images, calibrate and align these images, and crop them to the same size, where F is the number of original image categories, and P is the number of images of each category.

[0040] Step 2. Use the original image to obtain the original matrix X.

[0041] The grayscale feature values ​​of each original image pixel are taken out by column, and arranged in sequence to form an m-dimensional column vector, forming an m×n matrix X', normalizing each column of the matrix X', and normalizing The transformation is to make the sum of the elements of each column of the matrix X' equal to 1, that is:

[0042] Among them, v' j is the jth column vector o...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a data dimension reduction method based on neighborhood similarity and mainly solves the problem that an existing method only uses the Euclidean distance to measure a sample neighborhood structure, so that recognition results are non-ideal when a data structure is not balanced. The data dimension reduction method comprises the following realization steps: (1) inputting and normalizing data and randomly initializing a basis matrix and a coefficient matrix; (2) calculating a diagonal covariance matrix of a sample; (3) calculating KL (Kullback-Leibler) divergence through the diagonal covariance matrix; (4) calculating neighborhood sample similarity through the KL dispersion degree; (5) calculating a neighborhood class label distribution matrix of the sample; (6) calculating neighborhood class label similarity through the neighborhood class label distribution matrix; (7) calculating neighborhood similarity through the neighborhood sample similarity and the neighborhood class label similarity; (8) applying to iterative criterions according to the neighborhood similarity to obtain the basis matrix and the coefficient matrix after dimension reduction. The data dimensionality reduction method is high in accuracy rate and can effectively perform feature extraction and dimensionality reduction on data and be used for data and image processing.

Description

technical field [0001] The invention belongs to the technical field of data processing, in particular to a data dimensionality reduction method, which can be used for data and computer image recognition. Background technique [0002] The rapid development of science and technology in recent years has resulted in an explosion in the amount and availability of raw data. With the development of sensor and computer technology, more and more raw data are available, how to extract useful information from such massive data has become the focus of people's attention. Data dimensionality reduction is an important research field in machine learning. Obtaining an efficient representation through appropriate dimensionality reduction techniques has become an important, necessary and challenging problem in multivariate data analysis. Dimensionality reduction should satisfy two basic properties: first, the size of the original data should be reduced; second, find and retain the principal...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62
Inventor 王磊姬红兵范笑宇王家俊张文博
Owner XIDIAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products