Effective hybrid feature selection method based on improved binary krill swarm algorithm and information gain algorithm

An information gain, mixed feature technology, applied in the field of bioinformatics

Active Publication Date: 2020-02-25
HENAN UNIVERSITY
View PDF3 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0012] For the existing feature selection algorithm, it is impossible to cover the two aspects of "selecting the most valuable feature subset composed of relevant features from the original input data" and "improving the classification accuracy as much as possible". To solve the target problem, the present invention provides an effective mixed feature selection method based on the improved binary krill swarm algorithm and information gain algorithm, which can further improve the classification accuracy of features while selecting the optimal feature subset

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Effective hybrid feature selection method based on improved binary krill swarm algorithm and information gain algorithm
  • Effective hybrid feature selection method based on improved binary krill swarm algorithm and information gain algorithm
  • Effective hybrid feature selection method based on improved binary krill swarm algorithm and information gain algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0078] In order to make the purpose, technical solutions and advantages of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly described below in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are part of the present invention Examples, not all examples. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0079] The effective mixed feature selection method based on the improved binary krill swarm algorithm and the information gain algorithm proposed by the present invention is called IG-MBKH algorithm for short. combine figure 1 and figure 2 Shown, the IG-MBKH algorithm provided by the present invention comprises the following steps:

[0080] S101. Randomly initialize N indivi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides an effective hybrid feature selection method based on an improved binary krill swarm algorithm and an information gain algorithm. The algorithm comprises the following steps: step 1, randomly initializing N individuals in a population by adopting a feature sorting strategy based on an information gain algorithm; 2, calculating a fitness value of each individual in the population by adopting a set fitness function, and taking a solution expressed by the individual with the maximum fitness value in the population as a global optimal solution in the population; 3, updatingthe population by using an improved binary krill swarm algorithm, updating the fitness value of each individual in the population, and updating a globally optimal solution in the population; and 4, taking the step 3 as one iteration, and repeating the step 3 until the current number of iterations reaches the set number of iterations. Through test verification of 10-fold crossing on nine public biomedical data sets, the number of gene expression levels can be effectively reduced, and compared with other feature selection methods, high classification accuracy is obtained.

Description

technical field [0001] The invention relates to the technical field of bioinformatics, in particular to an effective mixed feature selection method based on an improved binary krill swarm algorithm and an information gain algorithm. Background technique [0002] With the development of DNA microarray technology in biomedicine, large-scale high-dimensional small-sample microarray data have been accumulated, such as the lung cancer microarray data set, including 181 samples, each containing 12533 ​​features. The high-dimensional and small-sample characteristics of the data have brought great challenges to gene analysis and disease diagnosis. High-dimensional small-sample data also brings great challenges to existing mining and learning algorithms. With the sharp increase of data dimensions, a large amount of redundant information and irrelevant information will usually be generated, which may greatly reduce the performance of machine learning algorithms, increase computationa...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/00G16B25/00
CPCG06N3/006G16B25/00Y02A90/10
Inventor 张戈王建林阎朝坤侯金翠罗慧敏
Owner HENAN UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products