Data dimension reduction method for improved neighborhood preserving embedding algorithm

A technology for data dimensionality reduction and algorithm improvement. It is applied in computing, computer parts, instruments, etc. It can solve the problems that the test sample cannot achieve the dimensionality reduction effect, cannot extract the feature of the sample set, and the dimensionality reduction result fluctuates greatly. Data dimensionality reduction, improving the ability of manifold structure, reducing the effect of reconstruction error

Inactive Publication Date: 2018-04-03
XI AN JIAOTONG UNIV
View PDF0 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In order to effectively explore the nonlinear structure contained in the data set, people have developed many effective nonlinear dimensionality reduction methods, such as artificial neural network, genetic algorithm, manifold learning, etc. for nonlinear dimensionality reduction algorithms. Linear algorithms perform well on training samples, but cannot achieve dimensionality reduction for test samples because they lack projection matrices and cannot perform feature extraction on newly added sample sets. In order to solve this problem, the linearized typical manifold learning algorithm is Proposed, such as the manifold learning (NPE) of the neighborhood-preserving embedding algorithm, using local representations to obtain projection matrices, and projecting high-dimensional manifold data to low-dimensional manifold spaces
However, such local representations usually assume that the local manifold space is linear, which will lead to large fluctuations in dimensionality reduction results.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data dimension reduction method for improved neighborhood preserving embedding algorithm
  • Data dimension reduction method for improved neighborhood preserving embedding algorithm
  • Data dimension reduction method for improved neighborhood preserving embedding algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] The present invention will be further explained below in conjunction with specific embodiments and accompanying drawings.

[0033] Since the NPE algorithm assumes that the manifold space is locally linear, the processing effect on the manifold space with a large curvature is not very good. The present invention uses the geodesic distance to replace the Euclidean distance. By selecting the real neighbor points in the manifold, Mining its inner real space well preserves the local structure information and improves the ability of this method to deal with high-dimensional data.

[0034] see Figure 4 , the present invention comprises the following steps:

[0035] Step 01: Construct an adjacency graph, and use geodesics to calculate the consistent adjacent points of each sample point to form an adjacency matrix;

[0036] Step 02: Calculate the reconstruction weight, and represent each sampling point as an adjacent point;

[0037] Step 03: Calculate the projection matrix, ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a data dimension reduction method for an improved neighborhood preserving embedding algorithm. The method comprises the steps of firstly, constructing an adjacency graph, and calculating out a near point of each sample point by using a geodesic line, thereby forming an adjacency matrix; secondly, calculating a reconstruction weight value, and representing each sampling point with the near point; and finally, calculating a projection matrix, and performing calculation by utilizing a reconstruction weight value matrix to obtain a transform projection matrix. According tothe method, a Euclidean distance is replaced with a geodesic line distance; local structure information of the NPE algorithm is better kept; and the manifold structure processing capability of the algorithm is improved.

Description

technical field [0001] The invention belongs to the field of big data processing, and relates to a data dimensionality reduction method, in particular to a data dimensionality reduction method with a neighborhood-preserving embedded improved algorithm. Background technique [0002] In the era of big data, the continuous expansion of data volume has led to an information explosion. These data often exhibit high-dimensional characteristics. Because of the complexity of its structure, high-dimensional data is usually difficult to directly process in the real world. For example, the main purpose of data mining is to use efficient algorithms to explore the information hidden behind the data, and finally transform it into knowledge to guide people to make reasonable decisions. In order to properly deal with these high-dimensional data, data dimensionality reduction technology was born. Data dimensionality reduction is the process of projecting data from a high-dimensional feature...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62
CPCG06F18/21322G06F18/21324
Inventor 董渭清李玥郭桑董文鑫陈建友仓剑袁泉
Owner XI AN JIAOTONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products