Multivariate discrete feature selection method, device, apparatu and storage medium

A feature selection method and feature selection technology, applied in the field of machine learning, can solve the problems of poor elimination of irrelevant features and redundant features, inaccurate classification results, etc.

Inactive Publication Date: 2019-01-29
SHENZHEN UNIV
View PDF0 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The purpose of the present invention is to provide a multivariate discrete feature selection method, device, equipment and storage medium, aiming to solve the problem of poor elimination of irrelevant features and redundant features due to the inability of the prior art to provide an effective feature selection method , the problem of inaccurate classification results

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multivariate discrete feature selection method, device, apparatu and storage medium
  • Multivariate discrete feature selection method, device, apparatu and storage medium
  • Multivariate discrete feature selection method, device, apparatu and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0028] figure 1 The implementation flow of the multivariate discrete feature selection method provided by the first embodiment of the present invention is shown. For the convenience of description, only the parts related to the embodiment of the present invention are shown, and the details are as follows:

[0029] In step S101, when a request for feature selection of the target data set input by the user is received, a cut point corresponding to each feature in the target data set is found through the minimum description length algorithm.

[0030] Embodiments of the present invention are applicable to computing devices, such as personal computers, servers, and the like. When receiving a request for feature selection on the target data set input by the user, find the cut point corresponding to each feature in the target data set through the minimum description length algorithm. The target data set is composed of high-dimensional data, such as genetic data. A cut point is any e...

Embodiment 2

[0052] image 3 The structure of the multivariate discrete feature selection device provided by the second embodiment of the present invention is shown. For the convenience of description, only the parts related to the embodiment of the present invention are shown, including:

[0053] The point-cut finding unit 31 is configured to find the point-cut corresponding to each feature in the target data set through the minimum description length algorithm when receiving a request for feature selection of the target data set input by the user.

[0054] Embodiments of the present invention are applicable to computing devices, such as personal computers, servers, and the like. When receiving a request for feature selection on the target data set input by the user, find the cut point corresponding to each feature in the target data set through the minimum description length algorithm. The target data set is composed of high-dimensional data, such as genetic data. A cut point is any eig...

Embodiment 3

[0075] Figure 4 The structure of the computing device provided by the third embodiment of the present invention is shown, and for the convenience of description, only the parts related to the embodiment of the present invention are shown.

[0076] The computing device 4 of the embodiment of the present invention includes a processor 40 , a memory 41 and a computer program 42 stored in the memory 41 and operable on the processor 40 . When the processor 40 executes the computer program 42, it realizes the steps in the embodiment of the multivariate discrete feature selection method, for example figure 1 Steps S101 to S106 are shown. Alternatively, when the processor 40 executes the computer program 42, the functions of the units in the above-mentioned device embodiments are realized, for example image 3 Function of units 31 to 34 shown.

[0077] In the embodiment of the present invention, the tangent point corresponding to each feature in the target data set input by the us...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention is applicable to the technical field of machine learning, and provides a multivariate discrete feature selection method, device, apparatus, and storage medium. The method comprises: initializing a particle swarm according to a tangent point corresponding to each feature in a target data set, obtaining the particle position of each particle, the target data set is discretized according to the particle position, the corresponding discrete data set is obtained, in accordance with discrete data set, the fitness of each particle is calculated by the fitness formula, so as to find outthe optimal position of the population of the particle swarm and the optimal position of the individual through which each particle passes, when the stop condition is met, optimal position of output population, otherwise, the particle position of each particle is updated according to the optimal position of the population and the optimal position of the individual, and the operation of data discretization and optimization is continued, so that fewer features are selected, the effect of eliminating redundant features and irrelevant features is improved, and the accuracy of classification learning algorithm is improved.

Description

technical field [0001] The invention belongs to the technical field of machine learning, and in particular relates to a multiple discrete feature selection method, device, equipment and storage medium. Background technique [0002] With the advent of the big data era, the importance of data has become increasingly prominent. Massive data promotes the development of the information society. However, with the continuous growth of data dimensions, the "dimension disaster" will be inevitable. In recent years, machine learning has been applied to various big data scenarios, such as DNA microarray analysis, image classification, text classification, etc., because these data have high data dimensions and there are some irrelevant data features in the data and redundant features, directly using the original data will affect the efficiency and performance of the learning algorithm. Therefore, in the process of machine learning, a series of preprocessing operations such as feature sel...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62
CPCG06F18/285G06F18/2111
Inventor 亢俊皓周宇郭海男林继平
Owner SHENZHEN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products