Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Formulae neighborhood based data dimensionality reduction method

A data dimension reduction and neighborhood technology, applied in the field of information processing, can solve problems such as parameters and external noise are too sensitive, dimension reduction performance failure, etc., to achieve the effect of broadening the applicable neighborhood, maintaining consistency, and good aggregation effect

Inactive Publication Date: 2010-11-17
ZHEJIANG UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, this method also has some application limitations, such as: it is too sensitive to parameters and external noise, and the dimensionality reduction performance fails when dealing with sparsely distributed data sets.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Formulae neighborhood based data dimensionality reduction method
  • Formulae neighborhood based data dimensionality reduction method
  • Formulae neighborhood based data dimensionality reduction method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0015] As shown in the figure, a data dimensionality reduction method based on rule neighborhoods includes the following:

[0016] 1. Formal description of data dimensionality reduction

[0017] Establish a five-tuple model: FO = (X, D, δ, d, Y),

[0018] Among them: D is the dimension of the high-dimensional space; d (d X = { x → 1 , x → 2 , . . . , x → N } , Is a high-dimensional space R D N D-dimensional real number vectors in ( x → i = ( x → i 1 , x → i 2 , . . . , x → i D ) T , i = 1,2 . . . , N ) ; Y is the output sample set of the model FO, expressed as: Y = { y → 1 , y → 2 , . . . , y → N } , Is a low-dimensional space R d N d-dimensional real number vectors in ( y → i = ( y → i 1 , y → i 2 , . . . , y → i...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a data dimension reduction method based on neighborhood rule. The method includes the following steps: firstly, a spherical neighborhood of present sample points is established by using the geometric spherical-modelling theory and all the sample points contained in the spherical neighborhood are adopted as candidate neighbor points, thus not only preserving the effectivityof the dimension reduction capability when data sets are sparse but also getting the advantages of low-sensitivity to isolated points and good stability of the preserved topological structure; then adata relevance matrix more matching semantics can be obtained by relevance measurement based on route clusters to update the candidate neighbor points in the spherical neighborhood and optimize the regular neighborhood space of the present sample points, thus improving the phenomenon that the dimension reduction of sample sets provided with folded curved faces is apt to suffer the integrated-structure distortion in case of heterogeneous data distribution. The experiments on different sample sets demonstrate that the method provided by the invention is available and effective.

Description

Technical field [0001] The present invention relates to the field of information processing, in particular to a data dimensionality reduction method based on rule neighborhoods. Background technique [0002] Data dimensionality reduction refers to reducing the data in a high-dimensional space to a space with a lower dimensionality to eliminate the redundancy of the original data and improve the subsequent processing ability of the data. Traditional data dimensionality reduction is mainly based on linear. Principle Component Analysis (PCA) and Linear Discriminant Analysis (LDA) are currently the two most widely used linear dimensionality reduction methods. Although these two methods are mature in theory and fast in calculation, they can only reduce the dimensionality of data with linear structure, and it is difficult to directly process large-scale, high-dimensional and nonlinear data. [0003] Seung and Lee pointed out in the article "The Manifold Ways of Perception" in the inter...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
Inventor 姚敏朱蓉
Owner ZHEJIANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products