Unlock instant, AI-driven research and patent intelligence for your innovation.

A recommended model training method and related device

A model training and model technology, applied in the field of big data, can solve problems such as large amount of data, reduced model prediction accuracy, and model overfitting

Active Publication Date: 2022-05-10
HUAWEI TECH CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

For example, in the recommendation system of Huawei AppGallery, tens of billions of data are generated every day under user operations. If sampling technology is not used in the training process of these tens of billions of data, the following problems will arise: (1) Due to the large amount of data and too much homogeneous data, the trained model is quickly overfitted; (2) Due to the large amount of data, a large amount of hardware resources are consumed, and the model training is too slow, which leads to model update If it is not timely, the prediction accuracy of the model will be reduced; (3) If the negative samples are far more than the positive samples, it will lead to serious data imbalance, which will affect the global classification performance of the model

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A recommended model training method and related device
  • A recommended model training method and related device
  • A recommended model training method and related device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0080] Embodiments of the present invention will be described below with reference to the drawings in the embodiments of the present invention.

[0081] At present, there are many machine learning scenarios that focus on TopN (top N) sorting, such as Figure 1A As shown, this scenario includes but is not limited to scenarios involving e-commerce product recommendation, search engine result recommendation, application market recommendation, music, application (APP) recommendation, video website recommendation, etc. The recommended items in various application scenarios are as follows: It is called "object" for the convenience of subsequent description. This machine learning scenario that focuses on TopN ranking usually involves user behavior log collection, log data preprocessing (such as quantization, sampling, etc.), sample learning to obtain a recommendation model, and object (such as APP, music, etc.), present the sorting results to the user, and the user operates the objec...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

An embodiment of the present invention provides a recommendation model training method and a related device, the method comprising: selecting a positive sample in a sample set and adding it to the training set; wherein, the sample set includes a positive sample and a negative sample, and each sample is composed of Composed of n sample features, where n≥1, the sample features of each sample include features used to characterize whether the sample is a positive sample or a negative sample; a preset is used for each multiple negative samples in the sample set The algorithm calculates the sampling probability; selects a negative sample from the sample set according to the sampling probability and adds it to the training set; trains the samples in the training set to obtain a recommendation model. Adopting the embodiment of the present invention in the field of artificial intelligence (AI) for model training can improve the performance of the model.

Description

technical field [0001] The present invention relates to the field of big data technology, in particular to a recommendation model training method and related devices. Background technique [0002] In many machine learning applications (such as recommendation systems) under the background of big data, sampling technology can improve the quality of data training, reduce computing resources, improve training efficiency, and solve the problem of data imbalance, so sampling technology is very important. For example, in the recommendation system of Huawei AppGallery, tens of billions of data are generated every day under user operations. If sampling technology is not used in the training process of these tens of billions of data, the following problems will occur: (1) Due to the large amount of data and too much homogeneous data, the trained model is quickly overfitted; (2) Due to the large amount of data, a large amount of hardware resources are consumed, and the model training i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06K9/62G06F16/735G06F16/78
CPCG06F16/78G06F16/735G06F16/9535G06F18/217G06N20/00G06N5/04
Inventor 祝宏董振华唐睿明张宇宙钱莉
Owner HUAWEI TECH CO LTD