Model training and data classification method and device

A model training and classifier technology, applied in the computer field, can solve problems such as time-consuming, a large number of manual interventions, unstable modeling effects, etc., to achieve the effect of ensuring effectiveness, reliable classification, and improving efficiency

Pending Publication Date: 2022-05-06
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] When the number of target samples is small, when building a model (for example, a classification model), generally only logistic regression can be used to model, and the modeling effect is not good; while performing EDA (Exploratory Data Analysis, data exploration) on multiple labeled samples When analyzing, it is necessary to use a manual strategy to merge multiple samples to obtain the required training samples. The mixed modeling is performed through the training samples. The modeling effect is unstable and requires a lot of manual intervention, which takes a long time.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Model training and data classification method and device
  • Model training and data classification method and device
  • Model training and data classification method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021] Exemplary embodiments of the present disclosure are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present disclosure to facilitate understanding, and they should be regarded as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the disclosure. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.

[0022] Aiming at the problem that the traditional small target sample modeling effect is not good, or the mixed modeling of multiple data source samples takes a long time and the effect is unstable, this disclosure proposes a model training method. figure 1 A process 100 according to an embodiment of the model training method of the present disclosure is shown, and the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a model training method and device, and relates to the technical field of big data, machine learning and the like. According to the specific implementation scheme, a training sample is obtained according to an auxiliary sample and a target sample which are obtained in advance; a preset weight rule is adopted to generate training weights of the training samples, and the weight rule is used for enabling the weight proportions of the auxiliary samples and the target samples to be the same; executing the following training steps: inputting a training sample and a training weight into a base learning device to obtain the output of the base learning device; and based on the output of the base learner, adopting a migration operator to adjust the training weight until a training stop condition is met, and obtaining a target model. According to the embodiment, the modeling efficiency is improved.

Description

technical field [0001] The present disclosure relates to the field of computer technology, specifically to big data, machine learning and other technical fields, and in particular to a method and device for model training and data classification, electronic equipment, computer-readable storage media, and computer program products. Background technique [0002] When the number of target samples is small, when building a model (for example, a classification model), generally only logistic regression can be used to model, and the modeling effect is not good; while performing EDA (Exploratory Data Analysis, data exploration) on multiple labeled samples When performing analysis, it is necessary to use manual strategies to merge multiple samples to obtain the required training samples, and to perform mixed modeling through training samples. The modeling effect is unstable, and a large amount of manual intervention is required, which takes a long time. Contents of the invention ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06N20/00
CPCG06N20/00G06F18/214G06F18/24
Inventor 王天祺刘昊骋徐世界徐靖宇田建
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products