Feature selection method and device for multi-tag data

A feature selection method and multi-label technology, applied in the field of data classification, can solve problems such as poor classification accuracy and complex calculation, and achieve the effect of improving classification accuracy

Active Publication Date: 2020-02-11
HENAN NORMAL UNIV
View PDF3 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The purpose of the present invention is to provide a feature selection method and device for multi-label data to solve the problems of complex calculation and poor classification accuracy in the current multi-label feature selection process

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Feature selection method and device for multi-tag data
  • Feature selection method and device for multi-tag data
  • Feature selection method and device for multi-tag data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] The specific embodiments of the present invention will be further described below in conjunction with the accompanying drawings.

[0027] method embodiment

[0028] The present invention first uses the prior probability of the label as the weight of the label to calculate the correlation between the feature and the label, so that there is a greater correlation between the pre-screened feature and the label; then use the correlation between the sample label sets Finally, the feature weights are calculated according to the weight update formula, and the optimal feature subset is selected according to the order of the feature weights. The feature selection method for multi-label data of the present invention can be applied to various fields, including but not limited to text classification, gene function classification, image annotation, video automatic annotation, etc. Taking the field of text classification as an example below, the specific implementation process of the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a feature selection method and device for multi-label data, and belongs to the technical field of data classification. According to the method, firstly, the prior probabilityof a mark is used as the weight of the mark, the correlation between features and the mark is calculated, the features are pre-screened according to the correlation, and the correlation between the features and the mark is as large as possible while the subsequent calculation amount is reduced; the method comprises the following steps: firstly, marking samples, then dividing the same type and different types of samples by utilizing correlation link values among marking sets of the samples, finally, calculating feature weight values according to a weight updating formula, sorting the feature weight values, and selecting an optimal feature subset. Through the process, the optimal feature subset can be effectively selected, and the classification precision of the multi-label feature selectionalgorithm is improved.

Description

technical field [0001] The invention relates to a feature selection method and device for multi-label data, belonging to the technical field of data classification. Background technique [0002] In traditional supervised learning, each instance corresponds to only one class label. However, in the real world, an object often has multiple concept labels at the same time. For example, an image may have labels such as "desert", "sun", and "cactus" at the same time, so the multi-label learning problem arises at the historic moment. At present, multi-label learning has received extensive attention and has been applied in many fields such as text classification, gene function classification, image annotation, and video automatic annotation. In practical applications such as text classification, the existence of a large amount of irrelevant and redundant information in high-dimensional data greatly reduces the performance of learning algorithms. Therefore, dimensionality reduction...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35G06K9/62
CPCG06F16/35G06F18/24143
Inventor 孙林施恩惠秦铮谭淑月曾祥师殷腾宇黄金旭王天翔王欣雅张玖肖
Owner HENAN NORMAL UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products