Feature selection method and traditional Chinese medicine main-symptom selection method based on feature groups

A feature selection method and feature group technology, which is applied in the field of pattern recognition and machine learning research, and can solve problems such as difficulties in the study of TCM syndrome differentiation.

Active Publication Date: 2017-10-24
EAST CHINA UNIV OF SCI & TECH +1
View PDF4 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

There may be dozens or even hundreds of possible symptoms corresponding to a certain syndrome type in TCM syndrome differentiation, which will bring difficulties to the research of TCM syndrome differentiation. The purpose of the invention is to provide a feature-based The selection method of the main symptoms of traditional Chinese medicine, embodying the syndrome elements in the syndrome differentiation model of traditional Chinese medicine can make the syndrome differentiation model of traditional Chinese medicine more consistent with the theory of traditional Chinese medicine

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Feature selection method and traditional Chinese medicine main-symptom selection method based on feature groups
  • Feature selection method and traditional Chinese medicine main-symptom selection method based on feature groups
  • Feature selection method and traditional Chinese medicine main-symptom selection method based on feature groups

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0026] like figure 1 As shown, this embodiment discloses a feature selection method based on feature groups, comprising the following steps:

[0027] Step (1), remove features irrelevant to the label content and features with too small frequency from the original feature set to obtain the feature set to be selected.

[0028] Step (2), using a feature clustering algorithm to cluster each feature in the feature set to be selected to obtain a corresponding feature group.

[0029] For the feature set to be selected D(X 1 ,X 2 ...X n ,Y), where X 1 ,X 2 ...X n is the n-dimensional input feature space, Y is the label, and the feature clustering algorithm is used to classify the input feature space X 1 ,X 2 ...X n Clustering is performed to obtain the corresponding feature groups.

[0030] The key to the Bayesian network with hidden variable learning is the discovery of hidden variables, that is, to determine the number of hidden variables in the network and their positions...

Embodiment 2

[0054] This embodiment is to apply Embodiment 1 to the method of selecting the main symptoms of traditional Chinese medicine, understand the syndrome elements that are the most basic elements of syndrome differentiation in TCM syndrome differentiation and cannot be observed as corresponding hidden variables, use symptoms as characteristics, and syndrome types As a label, embodying syndrome elements in the TCM syndrome differentiation model can make the TCM syndrome differentiation model more consistent with TCM theory.

[0055] like Figure 4 As shown, the feature selection method is used to select the main symptoms of damp-turbidity syndrome. When the feature subset contains the first four feature groups with the highest mutual information with the damp-turbidity syndrome type, the classification accuracy of the classifier reaches the highest, so the features contained in the first four feature groups are selected as the damp-turbidity syndrome type main symptom.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a feature selection method and a traditional Chinese medicine main-symptom selection method based on feature groups. The feature selection method based on the feature groups comprises the following steps: 1, screening an original feature set; 2, utilizing a feature clustering algorithm to cluster the screened feature set to obtain the corresponding feature groups; 3, introducing a latent variable to each feature group to obtain a corresponding latent class model (LCM), and calculating correlation between the latent variables and labels; 4, sorting the feature groups sequentially with decreasing according to the correlation between the latent variables and the labels; 5, adding the sorted feature groups into selected feature subsets in turn, and establishing a Bayesian network with the latent variables; and 6, calculating classification accuracy rates of the Bayesian network, then obtaining a curve of the numbers of the added feature groups and the classification accuracy rates, and obtaining the corresponding optimal feature subset through judging the convergence of the curve or the highest accuracy rate. According to the method, the feature groups are used as selected targets, and the feature groups composed of a plurality of features have better representation capacity for original data.

Description

technical field [0001] The invention relates to the field of pattern recognition and machine learning research, and the feature relates to a feature selection method based on feature groups. Background technique [0002] The main purpose of Feature Selection is to select the optimal feature subset that meets the specified evaluation criteria from the original features, so that the classification model or regression model constructed by the selected optimal feature subset achieves better performance than before feature selection. Better performance, through feature selection, not only improves the generalization ability of the model, so that the model can be better explained to improve computational efficiency, but also reduces the occurrence of dimensionality disaster. [0003] In the traditional feature selection algorithm, the selection of features is mainly for a single feature, considering the correlation between each feature and the label, and choosing each feature with...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F19/00G06K9/62
CPCG06F18/24155
Inventor 颜建军刘国萍顾巍杰郭睿燕海霞王忆勤王灼龙
Owner EAST CHINA UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products