Unlock instant, AI-driven research and patent intelligence for your innovation.

Unbalanced sample classification method and device

A classification method and classification algorithm technology, applied in the field of data processing, can solve the problems of low classification accuracy and complex calculation methods, and achieve the effect of reducing processing, ensuring classification quality, and efficient classification

Pending Publication Date: 2020-10-30
SHENZHEN ACAD OF INSPECTION & QUARANTINE +3
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] When traditional algorithms are applied to the classification problem of unbalanced samples, there are problems such as complex calculation methods and low classification accuracy.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Unbalanced sample classification method and device
  • Unbalanced sample classification method and device
  • Unbalanced sample classification method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] In order to make the above objects, features and advantages of the present invention more comprehensible, the present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0042] One of the core concepts of the embodiments of the present invention is to provide a method and device for classifying unbalanced samples, wherein, a method for classifying unbalanced samples includes: acquiring unbalanced sample data, the unbalanced sample data including sample data and feature data; use the sample data and the feature data to calculate the sample contribution rate; filter out the sample data within the preset sample contribution threshold according to the sample contribution rate, and determine it as the target sample data; the target sample data Input to the sample classification model and use the optimized classification algorithm to calculate the sample classification results. By using the two variable...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention provides an unbalanced sample classification method and device, and the method comprises the steps: obtaining unbalanced sample data which comprises sample data and feature data; calculating a sample contribution rate by utilizing the sample data and the feature data; screening out sample data within a preset sample contribution threshold according to the sample contribution rate, and determining the sample data as target sample data; and inputting the target sample data into a sample classification model, and performing calculation by using an optimization classification algorithm to obtain a sample classification result. Two variables of a feature value contribution rate and a feature contribution degree are used to eliminate the characteristics and samples with low classification contribution degree are eliminated, the processing of unbalanced sample data is effectively reduced, the efficient classification is realized by using a machine learning algorithm and effective characteristics or samples on the basis, and the classification efficiency is improved on the premise of ensuring the classification quality.

Description

technical field [0001] The invention relates to the technical field of data processing, in particular to a method for classifying unbalanced samples and a device for classifying unbalanced samples. Background technique [0002] Various classification problems are often encountered in real life, such as identifying high-quality customers among many loan applicants, insurance companies judging car insurance levels based on vehicles and car owners, and food grading based on food information samples. When the various samples of the classification problem are relatively balanced, it is easy to get very accurate results. However, in the case of very large differences in the proportions of various samples, the sample ratio reaches 1:100, which is called sample imbalance. It is a big challenge to get a more ideal classification effect. [0003] When dealing with the problem of unbalanced samples, it is currently mainly solved through data sampling processing and algorithm adjustme...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62
CPCG06F18/241G06N20/00G06N3/08G06F18/214G06F18/2115
Inventor 包先雨蔡伊娜阮周曦郭云吴绍精卢体康陈枝楠
Owner SHENZHEN ACAD OF INSPECTION & QUARANTINE