Multi-dimensional data feature selection method combining genetic algorithm and dragonfly algorithm

A feature selection method and genetic algorithm technology, applied in the field of multi-dimensional data feature selection, can solve the problems of premature convergence of genetic algorithm and suboptimal solution of feature combination.

Active Publication Date: 2021-01-12
JILIN UNIV
View PDF3 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The researchers applied the genetic algorithm to the feature selection process to obtain a better feature combination,

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-dimensional data feature selection method combining genetic algorithm and dragonfly algorithm
  • Multi-dimensional data feature selection method combining genetic algorithm and dragonfly algorithm
  • Multi-dimensional data feature selection method combining genetic algorithm and dragonfly algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0099] refer to figure 1 As shown, the steps of a multi-dimensional data feature selection method combining genetic algorithm and dragonfly algorithm are as follows:

[0100] 1) Simple cleaning of traffic accident data;

[0101] 2) the data feature screening result in step 1) is used as the input of a multidimensional data feature selection method combining genetic algorithm and dragonfly algorithm, and the output result is the data feature selected by the method of the present invention;

[0102] refer to figure 2 As shown, the traffic accident data is simply cleaned, and according to the characteristics of each dimension of the data, the features with only a single value in this dimension feature and more than half of the data missing are screened out and the information entropy values ​​are ranked from large to small. Feature screening, that is, this feature is not selected for all data for model training, and the rest of the features are retained. The data screening st...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a multi-dimensional data feature selection method combining a genetic algorithm and a dragonfly algorithm. The method comprises the following specific steps that traffic accident data is simply cleaned; the dragonfly algorithm is embedded into the genetic algorithm to intervene the crossover operation, and the optimal intersection position is found out through the dragonflyalgorithm to improve the optimization speed of the genetic algorithm. A dragonfly algorithm is embedded into a genetic algorithm to intervene mutation operation, and different gene position mutationprobabilities are set by judging whether food and natural enemy gene positions calculated by the dragonfly algorithm are selected or not, so that the convergence rate of the algorithm is increased. The data feature screening result is taken as the input of a multi-dimensional data feature selection method combining a genetic algorithm and a dragonfly algorithm, and the output result is the data feature selected by the algorithm. Experiments prove that the method has good performance for different classifiers, and it is verified that the feature selection method is effective and has robustness.

Description

technical field [0001] The invention relates to a method belonging to machine learning, more precisely, the invention designs a multi-dimensional data feature selection method combining genetic algorithm and dragonfly algorithm. The present invention can not only be applied to the field of machine learning, but also can be extended to other fields, and other fields also belong to the protection scope of this patent. Background technique [0002] Machine learning is an emerging discipline in recent years, involving many fields. Machine learning specializes in the study of how computers simulate or implement human learning behaviors to acquire new knowledge or skills and continuously improve their own performance. In machine learning, the selection of data sets is particularly important. For the selection of data sets, the first is to select a data set with enough features and samples, and the second is to perform feature selection in the data set. Since optimization in the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/62G06N3/00G06N3/12
CPCG06N3/006G06N3/126G06F18/2415G06F18/214
Inventor 杨晓萍王星乔柳莹于树友李娟
Owner JILIN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products