Check patentability & draft patents in minutes with Patsnap Eureka AI!

High-dimensional feature screening method based on hybrid ant colony optimization algorithm

An ant colony algorithm and feature screening technology, applied in the field of data processing, can solve problems such as low performance of feature subsets and increased algorithm running time, and achieve the effects of improving classification performance, accelerating search, and good classification performance

Inactive Publication Date: 2020-02-14
XIDIAN UNIV
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, most of the existing feature screening methods based on ant colony algorithm are applied to low-dimensional feature screening. If they are directly applied to high-dimensional feature screening, the running time of the algorithm will increase, and the performance of the obtained feature subset is also lower than that of traditional feature screening methods.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • High-dimensional feature screening method based on hybrid ant colony optimization algorithm
  • High-dimensional feature screening method based on hybrid ant colony optimization algorithm
  • High-dimensional feature screening method based on hybrid ant colony optimization algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] Embodiments of the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0031] refer to figure 1 , the specific implementation steps of the present invention are as follows:

[0032] Step 1, data preprocessing.

[0033] Download public gene expression datasets from publicly available websites. Then the data set is preprocessed. Since the value ranges of each feature of these data in the data set are different, in order to unify the weight of each feature, normalization processing is performed here. At present, the normalization uses the maximum and minimum min-max or Gaussian z-score, because the Gaussian z-score method destroys the distribution of the original data, which is not conducive to the screening of high-dimensional features, so this example uses the maximum and minimum conversion function to the original data Each dimension is linearly normalized so that the resulting value is mapped between 0 and 1. ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a high-dimensional feature screening method based on a hybrid ant colony optimization algorithm, mainly solving the problems of poor performance of a feature subset screened out in the prior art and long time consumption in the screening process. The technical scheme of the high-dimensional feature screening method includes the steps: preprocessing input high-dimensional data, and calculating the correlation and the symmetry uncertainty of the preprocessed data; initializing an ant colony algorithm, and constructing a feature subset through each ant; calculating the fitness of each feature subset, sorting the feature subsets, and selecting the contemporary optimal feature subset and the fitness thereof; iteratively updating the optimal feature subset of each generation and the fitness of the optimal feature subset; and comparing the fitness of the optimal feature subset of each generation, and taking the feature subset with the highest fitness as a final screened feature. According to the high-dimensional feature screening method, searching of the optimal feature subset is accelerated, and the screened feature subset has good classification performance, andthe overall operation time of screening is shortened, and the high-dimensional feature screening method can be used for channel selection in cancer gene analysis and hyperspectral image classification.

Description

technical field [0001] The invention belongs to the technical field of data processing, and in particular relates to a feature screening method, which can be used for channel selection in cancer gene analysis and hyperspectral image classification. Background technique [0002] Pattern recognition is a fundamental technique of artificial intelligence aimed at classifying objects into classes or categories. Recently, with the ease of data acquisition and the accumulation of large amounts of data, the main challenge in pattern recognition, namely the curse of dimensionality, has become more prominent. If the features are not screened and the data is used directly such as classification, the redundant features will increase the learning and testing time, and the irrelevant features will cause the performance of the classifier to decline. Especially in high-dimensional data sets, these shortcomings will be more obvious. Therefore, feature screening is necessary in data preproc...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06N3/00
CPCG06N3/006G06F18/211G06F18/241
Inventor 马文萍周晓波朱浩武越李龙伟
Owner XIDIAN UNIV
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More