Unlock instant, AI-driven research and patent intelligence for your innovation.

Data characteristic selection method with structure maintenance characteristic

A data feature and feature selection technology, applied in the field of information processing, can solve problems such as overfitting, reduce feature selection results, and difficult solutions, and achieve the effects of avoiding noise, ensuring convergence, and improving robustness

Active Publication Date: 2018-08-10
XI'AN INST OF OPTICS & FINE MECHANICS - CHINESE ACAD OF SCI
View PDF7 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] Although filtering-based methods are relatively easy to implement, such methods evaluate features separately and often ignore global information, so they cannot achieve better feature selection results on some tasks.
Encapsulation-based methods often have a large time complexity and are prone to overfitting, thereby reducing the result of feature selection
Embedding-based methods usually need to learn a class label matrix, but the real class label matrix is ​​composed of discrete values, which is difficult to solve. Therefore, current methods use continuous values ​​to approximate discrete values, which will introduce noise and get unstable Feature Selection Results

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data characteristic selection method with structure maintenance characteristic
  • Data characteristic selection method with structure maintenance characteristic
  • Data characteristic selection method with structure maintenance characteristic

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0067] Below in conjunction with accompanying drawing, the step that the present invention realizes is described in further detail:

[0068] refer to figure 1 , the steps that the present invention realizes are as follows:

[0069] Step 1, determine the original data set X, and construct a self-expression model of the original data set X;

[0070] X=N×d, wherein, N is the number of data, and d is the dimension of data features; both N and d are positive integers;

[0071] The specific construction method is:

[0072] For the i-th feature of the original dataset X, build a self-expressive model:

[0073]

[0074] where w ji is the expression coefficient, f i Represents the i features of the original data set X, |·| p is the p-norm of the original data set X, f j Represents the j features of the original data set X;

[0075] The self-expressive model of the original dataset X is:

[0076] min||W|| p ,X=XW, (2)

[0077] Among them, W ∈ R d×d , W is the reconstructe...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a data characteristic selection method with a structure maintenance characteristic. A more effective monitoring-free characteristic selection algorithm can be obtained, according to the algorithm, a self-expression model is used to model a characteristic selection problem,. noise problem caused by learning pseudo label data can be avoided, the robustness of the algorithm isimproved by adding the structure maintenance characteristic, and a clustering result of higher precision is obtained. The method is realized by the steps that (1) an original data set X is determined, and the self-expression model of the original data set X is constructed; (2) a local manifold structure maintenance constraint is added to the self-expression model; (3) a reconstruction coefficientmatrix W added with the local manifold structure maintenance constraint is constrained to obtain a target function expression; (4) the target function expression is solved in an optimized way; and (5) characteristic selection is carried out on a characteristic selection matrix obtained by solution.

Description

technical field [0001] The invention belongs to the technical field of information processing, and in particular relates to a data feature selection method with structure-preserving characteristics. Background technique [0002] Feature selection is a very effective data analysis technique. It has attracted the attention and research of many scholars and has achieved good results in many practical tasks. It has been applied to image processing and computer vision, such as face clustering. , handwritten character recognition and object classification. From the perspective of whether to use labeled data, feature selection can be divided into three categories: supervised feature selection, semi-supervised feature selection, and unsupervised feature selection. [0003] Supervised feature selection is trained on labeled data, and then experimentally verified on test samples. Because the supervised feature selection method uses a large amount of prior information, it can obtain h...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62
CPCG06F18/213
Inventor 李学龙鲁全茂董永生
Owner XI'AN INST OF OPTICS & FINE MECHANICS - CHINESE ACAD OF SCI