Unbalanced data classification model training method, device and equipment and storage medium

A technology for model training and data classification, applied in the field of information processing, can solve problems such as inaccurate classification, and achieve the effect of reducing storage space, reducing false positive rate, and enhancing impact.

Pending Publication Date: 2019-08-23
PING AN TECH (SHENZHEN) CO LTD
View PDF0 Cites 28 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Embodiments of the present invention provide a training method, device, equipment, and storage medium for an unbalanced data classification

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Unbalanced data classification model training method, device and equipment and storage medium
  • Unbalanced data classification model training method, device and equipment and storage medium
  • Unbalanced data classification model training method, device and equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are part of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0028] The unbalanced data classification model training method provided by the present invention can be applied in such as figure 1 In the application environment, the application environment includes a server and a preset sample library, wherein the preset sample library is a database storing unbalanced data; the server is a computer device for training unbalanced data, and the server can be a server or A server cluster; the server and the preset sam...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an unbalanced data classification model training method and device, computer equipment and a storage medium. The method comprises the steps of obtaining unbalanced data from apreset sample library; performing dimension reduction processing on the unbalanced data according to a preset dimension reduction method to obtain low-dimensional data after dimension reduction; sampling the low-dimensional data according to a preset sampling mode to obtain balance data; and taking the balance data as a training sample, and training the training sample by using a preset machine learning algorithm to obtain a classification model. According to the technical scheme, the classification model obtained through training is used for classification, the misjudgment rate of few types of data in the unbalanced data can be reduced, and therefore the classification accuracy is improved.

Description

technical field [0001] The invention relates to the field of information processing, in particular to a training method, device, equipment and storage medium for an unbalanced data classification model. Background technique [0002] In the practical application of classifying data using machine learning methods, dealing with imbalanced data has always been a thorny problem. Imbalanced data refers to the unbalanced proportion of samples from different categories during training or classification. For example, in user fraud detection, the proportion of fraudulent behavior is much smaller than that of non-fraudulent behavior. Imbalanced data widely exists in practical applications such as fault detection, defect detection, network intrusion detection, and medical diagnosis. [0003] In unbalanced data, although the number of samples is small, it will also have an important impact on the results of training or classification, so it cannot be ignored as noise. However, if trad...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/62
CPCG06F18/213G06F18/2411G06F18/214
Inventor 金戈徐亮
Owner PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products