Classification model training and using method and device, equipment and medium

A classification model and training method technology, applied in the computer field, can solve problems such as poor model performance, unbalanced category distribution, failure to achieve application effect and application performance, and achieve the effect of overcoming poor processing performance

Pending Publication Date: 2020-05-29
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF0 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, in real-world application scenarios, classification data sets mostly have obvious category distribution imbalance problems, that is, in the same data sample, the number of samples of some categories is much larger than that of other categories

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Classification model training and using method and device, equipment and medium
  • Classification model training and using method and device, equipment and medium
  • Classification model training and using method and device, equipment and medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0088] Exemplary embodiments of the present application are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present application to facilitate understanding, and they should be regarded as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the application. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.

[0089] figure 1 It is a schematic flowchart of a classification model training method provided according to an embodiment of the present application. At present, many classification algorithms are applied. Among them, the Bayesian method is used for classification, and it is difficult to accurately obtain the distribution probability of data. The decision tree metho...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a classification model training and using method and device, equipment and a medium, and relates to the technical field of artificial intelligence. The training method comprises the steps of utilizing a training sample set, adopting an ensemble learning algorithm to train a classification model to obtain at least one classifier set, wherein the classifier set comprises at least two base classifiers; testing the classifier set by adopting a test sample set to determine the classification performance of each base classifier, and determining the classification weight of each base classifier according to the classification performance, wherein the classification weight is used for determining the weight of the classification result of each base classifier when the classifier set is used for sample classification. According to the embodiment of the invention, the diversity of the base classifiers in ensemble learning can be ensured, and the classifiers have the common classification performance for minority classes and majority classes, so that the classification model is suitable for the condition of unbalanced samples.

Description

technical field [0001] This application relates to the field of computer technology, in particular to an artificial intelligence technology. Background technique [0002] In recent years, with the rapid development of big data and artificial intelligence technology, speech and image recognition, natural language processing and knowledge graph have become hot research fields. Classification problems are the most typical problems in the field of machine learning and data mining. [0003] However, in real-world application scenarios, most classification data sets have obvious category distribution imbalance problems, that is, in the same data sample, the number of samples of some categories is much larger than that of other categories or a certain category. This kind of data set is called a category-imbalanced data set. An unbalanced data set is likely to lead to poor performance of the trained model, making it unable to achieve the desired application effect and application p...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/62G06N20/00
CPCG06N20/00G06F18/241G06F18/214G06N20/20G06N3/126G06N5/02G06F18/2415
Inventor 盛文佳吴明丹高春旭叶峻
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products