Markov carpet embedded type feature selection method based on packaging

A feature selection method and embedded technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as the inability to quickly identify redundant features

Active Publication Date: 2015-12-30
HEFEI UNIV OF TECH
View PDF6 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] (2) Redundant features in the candidate feature set cannot be quickly identified, and these redundant features are kept in the candidate feature set until the end of the feature selection method, resulting in repeated evaluation of these redundant features

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Markov carpet embedded type feature selection method based on packaging
  • Markov carpet embedded type feature selection method based on packaging
  • Markov carpet embedded type feature selection method based on packaging

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0049] In this embodiment, it is assumed that the researched object is a data set Data composed of m instances, which is recorded as Data={inst 1 ,inst 2 ,...,inst i ,...,inst m}, for example, the data set Data can be microarray gene expression data; inst i Indicates the i-th instance; 1≤i≤m; the i-th instance inst i by n features i.e. the genes in the microarray data, and a categorical variable C i Composition, that is, the category corresponding to the microarray sample, such as cancer / normal; Indicates the i-th instance inst i The j-th feature in , 1≤j≤n; the j-th feature vector is composed of the j-th features of m instances, denoted as Thus, n feature vectors of m instances are obtained, denoted as f={f 1 , f 2 ,..., f j ,..., f n}; The category vector is composed of category variables of m instances, denoted as C={C 1 ,C 2 ,...,C i ,...,C m}; The attribute vector D of the data set Data is composed of n feature vectors f and category vector C var ={f 1...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a Markov carpet embedded type feature selection method based on packaging. The Markov carpet embedded type feature selection method is characterized by being carried out according to the following steps: 1, acquiring an optimal feature by using a five-fold cross validation method; 2, judging whether the optimal feature is a null set not; if so, finishing feature selection; otherwise, updating a feature subset; 3, deleting redundancy features by using a Markov carpet method so as to update a feature vector; and 4, judging whether the feature vector is a null set not; if so, finishing the feature selection; otherwise, repeating the step 2. By virtue of adopting the Markov carpet embedded type feature selection method, the high-quality feature subset can be obtained; and meanwhile, the time complexity of the feature selection method based on the packaging is reduced so that relatively good classification performance and time performance can be obtained.

Description

technical field [0001] The invention belongs to the field of data mining, in particular to a Markov blanket-embedded feature selection method based on encapsulation. Background technique [0002] Feature selection, as a data preprocessing technique, is widely used in machine learning and data mining tasks, such as classification, regression, and clustering. When the original feature space of the data includes irrelevant or redundant features with the target task, classifiers built on the entire feature space often have poor performance, for example, Naive Bayesian classifiers are sensitive to redundant features. The purpose of feature selection is to apply an effective feature selection method to select a set of discriminative features from the original feature space. An effective feature selection method can not only reduce the dimensionality of the original feature space, but also reduce the training time of the classifier and improve its generalization ability. More impo...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F19/20G06F19/24
Inventor 杨静王爱国安宁
Owner HEFEI UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products