Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Data classification method and device, equipment and computer readable storage medium

A data classification and majority class technology, applied in the field of information processing, can solve problems such as unsatisfactory prediction results, inaccurate prediction results, and too many new samples

Inactive Publication Date: 2018-08-10
PING AN TECH (SHENZHEN) CO LTD
View PDF0 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] At present, in the process of data modeling and classification of data, especially in the case of multi-classification, there are often problems of class imbalance in various samples. When the number of training samples of various types is quite different, the unbalanced samples are directly used. If the classification model is obtained by training, the result of model training may be very unsatisfactory due to the imbalance of the number of samples of various types. Then the prediction result obtained by using the trained model for prediction is not ideal, or even the prediction result is opposite.
[0003] At present, the more common practice is to increase the number of samples by generating new samples for those samples with a small number, so as to reach the level of equilibrium with the number of samples with a large number, but because the new samples are not real samples, and the generated There should not be too many new samples, so most of the more samples need to be discarded, but because most of the samples are discarded, the prediction results of the established data model may be inaccurate

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data classification method and device, equipment and computer readable storage medium
  • Data classification method and device, equipment and computer readable storage medium
  • Data classification method and device, equipment and computer readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] The technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are part of the embodiments of the present invention, rather than all of them. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of the present invention.

[0028] It should be understood that when used in this specification and the appended claims, the terms "including" and "including" indicate the existence of the described features, wholes, steps, operations, elements and / or components, but do not exclude one or The existence or addition of multiple other features, wholes, steps, operations, elements, components, and / or collections thereof.

[0029] It should also be understood that the term "and / or" used...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a data classification method and device, equipment and a computer readable storage medium. According to the data classification method and device, the equipment and the computerreadable storage medium provided by the invention, all minority sample sets are mixed to form a new sample set which serves as training samples to perform machine learning together with majority sample sets so as to obtain a first classification model, so that a problem of imbalance in sample quantity of the minority sample sets and the majority sample sets is solved; and all of the minority sample sets serve as new samples to perform training again so as to obtain a second classification model, the type of the data is predicted through combining the first classification model and the secondclassification model, and a problem of inaccurate prediction for the minority data is solved.

Description

Technical field [0001] The present invention relates to the field of information processing technology, in particular to a data classification method, device, equipment and computer-readable storage medium. Background technique [0002] At present, in the process of data modeling and classification of data, especially in the case of multi-classification, there are often problems of imbalance in various types of samples. When the number of training samples of various types differs greatly, unbalanced samples are directly used If the classification model is obtained by training, the result of model training may be very unsatisfactory due to the imbalance of the number of various samples. Then the prediction result obtained by using the trained model to predict is not ideal, or even the opposite. [0003] The current common practice is to increase the number of samples with a smaller number of samples by generating new samples to achieve a level that is balanced with the number of sam...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62
CPCG06F18/24G06F18/217G06F18/24317G06F18/214
Inventor 伍文岳
Owner PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products