Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Multi-data-set feature selection method and system based on multi-task evolutionary algorithm

A feature selection method and evolutionary algorithm technology, applied in the field of feature selection, can solve the problems of high computational cost, multi-dataset feature selection without sharing, etc., to improve efficiency, reduce the risk of falling into local optimum, reduce the overall computational cost and effect of time

Pending Publication Date: 2021-04-16
HANGZHOU INNOVATION RES INST OF BEIJING UNIV OF AERONAUTICS & ASTRONAUTICS
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] In order to overcome the problem that the existing feature selection technology based on evolutionary algorithm only supports the processing of a single data set, the computational cost of feature selection for multiple data sets and multi-models is high, and the feature selection of multiple data sets is not shared, the present invention proposes a A multi-dataset feature selection method based on a multi-task evolutionary algorithm. A core multi-task evolutionary module is designed so that the evolutionary algorithm can perform feature selection on multiple data sets at the same time. It not only supports the transfer of shared feature selection schemes between different data sets, It also supports independent evolution of each data set for each scenario

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-data-set feature selection method and system based on multi-task evolutionary algorithm
  • Multi-data-set feature selection method and system based on multi-task evolutionary algorithm
  • Multi-data-set feature selection method and system based on multi-task evolutionary algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] The present invention will be further elaborated and described below in combination with specific embodiments. The technical features of the various implementations in the present invention can be combined accordingly on the premise that there is no conflict with each other.

[0042] Such as figure 1 As shown, the multi-dataset feature selection method based on the multi-task evolutionary algorithm of the present invention mainly includes the following steps:

[0043]Step 1: For the multi-regression model integration task in different scenarios, divide the dataset under the task into datasets according to different scenarios, and each scenario corresponds to a sub-dataset;

[0044] Step 2: Initialize the environmental parameters and constraints of the evolutionary algorithm, and encode the features in the sub-dataset;

[0045] Step 3: For the sub-datasets in each scenario, select excellent individuals from the parent population to form a sub-population, and generate s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a multi-data-set feature selection method and system based on a multi-task evolutionary algorithm, and belongs to the technical field of feature selection. The method comprises the following steps: (1) cleaning and splitting a data set; (2) initializing an evolutionary algorithm; (3) selecting excellent individuals to form sub-populations, and generating filial generation individuals; (4) evaluating the sub-population model; (5) carrying out independent evolution on each sub-data set; (6) stopping checking by the algorithm; and (7) outputting a feature selection result and a regression model of each sub-data set. According to the feature selection method, feature selection can be carried out on the multiple data sets at the same time, a shared feature selection scheme transmitted among the different data sets is supported, collaborative feature selection is carried out on the multiple data sets, and independent evolution of the data sets for respective scenes is also supported. Therefore, the calculation overhead consumed in repeated operation is remarkably reduced, and the calculation efficiency of multi-data-set feature selection is remarkably improved.

Description

technical field [0001] The invention belongs to the technical field of feature selection, and in particular relates to a multi-data set feature selection method and system based on a multi-task evolutionary algorithm. Background technique [0002] With the rapid development of big data and artificial intelligence technology, the importance of mining useful information from massive data has become increasingly prominent. With the exponential growth of data in dimension and depth, the demand for computing resources of various machine learning algorithms has also increased significantly. Therefore, data dimensionality reduction has become one of the effective means to reduce the computational cost of machine learning and improve model accuracy. As one of the common dimensionality reduction methods, feature selection is an effective means to solve the curse of dimensionality, and has important theoretical and application value for classification and regression problems. Feature...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F30/27G06K9/62G06N3/12G06F111/04
Inventor 罗喜伶金晨张泊宇
Owner HANGZHOU INNOVATION RES INST OF BEIJING UNIV OF AERONAUTICS & ASTRONAUTICS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products