Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Formulae neighborhood based data dimensionality reduction method

A data dimensionality reduction and neighborhood technology, applied in the field of information processing, can solve the problems of being too sensitive to parameters and external noise, and the failure of dimensionality reduction performance, so as to achieve the effect of widening the applicable neighborhood, maintaining consistency, and improving the aggregation effect.

Inactive Publication Date: 2008-12-31
ZHEJIANG UNIV
View PDF0 Cites 22 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, this method also has some application limitations, such as: it is too sensitive to parameters and external noise, and the dimensionality reduction performance fails when dealing with sparsely distributed data sets.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Formulae neighborhood based data dimensionality reduction method
  • Formulae neighborhood based data dimensionality reduction method
  • Formulae neighborhood based data dimensionality reduction method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0015] As shown in the attached figure, a data dimensionality reduction method based on rule neighborhood includes the following contents:

[0016] 1. Formal description of data dimensionality reduction

[0017] Establish a quintuple model: FO=(X, D, δ, d, Y),

[0018] Among them: D is the dimension of the high-dimensional space; d (d X = { x → 1 , x → 2 , . . . , x → N } , is the high-dimensional space R D N D-dimensional real vectors in ( x → i = ( x → i 1 ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a data dimension reduction method based on neighborhood rule. The method includes the following steps: firstly, a spherical neighborhood of present sample points is established by using the geometric spherical-modelling theory and all the sample points contained in the spherical neighborhood are adopted as candidate neighbor points, thus not only preserving the effectivity of the dimension reduction capability when data sets are sparse but also getting the advantages of low-sensitivity to isolated points and good stability of the preserved topological structure; then a data relevance matrix more matching semantics can be obtained by relevance measurement based on route clusters to update the candidate neighbor points in the spherical neighborhood and optimize the regular neighborhood space of the present sample points, thus improving the phenomenon that the dimension reduction of sample sets provided with folded curved faces is apt to suffer the integrated-structure distortion in case of heterogeneous data distribution. The experiments on different sample sets demonstrate that the method provided by the invention is available and effective.

Description

technical field [0001] The invention relates to the field of information processing, in particular to a data dimension reduction method based on rule neighborhood. Background technique [0002] Data dimensionality reduction refers to reducing the data in a high-dimensional space to a lower-dimensional space to eliminate the redundancy of the original data and improve the subsequent processing capability of the data. Traditional data dimensionality reduction is mainly based on linearity. Principal Component Analysis (PCA) and Linear Discriminant Analysis (LDA) are the two most widely used linear dimensionality reduction methods. Although these two methods are theoretically mature and computationally fast, they can only reduce the dimensionality of data with a linear structure, and are difficult to directly deal with large-scale, high-dimensional and nonlinear data. [0003] Seung and Lee pointed out in the article "The Manifold Ways of Perception" in the internationally ren...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
Inventor 姚敏朱蓉
Owner ZHEJIANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products