Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Classification model training method and device and data classification method and device

A classification model and data technology, applied in the field of data classification, can solve the problems of increasing the training cost of the classification model, large differences in the proportion of categories, and difficulty of the classification model, so as to improve the accuracy of classification and ensure the effect of training

Active Publication Date: 2019-11-12
BEIJING KINGSOFT DIGITAL ENTERTAINMENT CO LTD +1
View PDF11 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In the training method of the existing classification model, the sample data is generally directly selected from the sample data set and input into the classification model to train the classification model, but the number of samples of one category in the sample data set is much larger than the number of samples of other categories In some cases, the category imbalance of the number of samples in the sample set used for training, that is, the category imbalance of the number of samples in the sample data set, the trained classification model is only good for the classification of samples of different categories in the training sample set, but When classifying the original data set, due to the large difference in the proportion of categories between the original data set and the training sample set, the error rate of the classification result predicted by the trained classification model is high, and the existing trained classification model is difficult to carry out practical application
[0004] In the training of classification models, if you want to establish a training set with balanced sample categories, you need to spend a lot of manpower and material resources to find and process materials to obtain a training set with balanced sample categories, which will greatly increase the cost of classification model training.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Classification model training method and device and data classification method and device
  • Classification model training method and device and data classification method and device
  • Classification model training method and device and data classification method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] In the following description, numerous specific details are set forth in order to provide a thorough understanding of the application. However, the present application can be implemented in many other ways different from those described here, and those skilled in the art can make similar promotions without violating the connotation of the present application. Therefore, the present application is not limited by the specific implementation disclosed below.

[0035] Terms used in one or more embodiments of this specification are for the purpose of describing specific embodiments only, and are not intended to limit one or more embodiments of this specification. As used in one or more embodiments of this specification and the appended claims, the singular forms "a", "the", and "the" are also intended to include the plural forms unless the context clearly dictates otherwise. It should also be understood that the term "and / or" used in one or more embodiments of the present sp...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a classification model training method and device and a data classification method and device. The classification model training method comprises: acquiring a sample data set, the sample data set comprising at least three category labels and feature data corresponding to the category labels, and counting the proportion of the number of each category label in the sample dataset; according to the proportion of the number of each category label in the sample data set, dividing the category labels in the sample data set into at least two sample groups; and inputting the sample group into a corresponding classification model for training until a training condition is met. According to the method, the classification labels in the sample data set are unbalanced in proportion, the quality of the processed sample data set is greatly improved, the training effect of the classification model can be further ensured, and the classification accuracy of the trained classification model is greatly improved during actual classification prediction of the trained classification model.

Description

technical field [0001] The present application relates to the technical field of data classification, and in particular to a classification model training method and device, a data classification method and device, computing equipment, and a computer-readable storage medium. Background technique [0002] Data classification is to automatically classify and mark data according to a certain classification system or standard, such as text classification, to automatically classify the input text according to a certain category system. Text classification technology has been widely used in natural language processing fields such as text review, advertisement filtering, sentiment analysis and anti-pornography recognition. [0003] In the training method of the existing classification model, the sample data is generally directly selected from the sample data set and input into the classification model to train the classification model, but the number of samples of one category in t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/35G06K9/62
CPCG06F16/35G06F18/24G06F18/214
Inventor 王献唐剑波李长亮
Owner BEIJING KINGSOFT DIGITAL ENTERTAINMENT CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products