Training sample processing method and device, equipment and storage medium

A technology for training samples and processing methods, applied in the field of data processing, can solve the problem that training samples cannot be output in real time, and achieve the effects of rapid and stable output, improved stability, and reduced memory

Pending Publication Date: 2021-06-08
BIGO TECH PTE LTD
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The embodiment of the present application provides a training sample processing method, device, equipment and storage medium, which can solve the problem that the training samples cannot be produced in real time, and ensure the timeliness of the recommendation model and the recommendation system

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Training sample processing method and device, equipment and storage medium
  • Training sample processing method and device, equipment and storage medium
  • Training sample processing method and device, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0032] figure 1 A flow chart of a training sample processing method provided in Embodiment 1 of the present application is given. The training sample processing method provided in this embodiment can be executed by a training sample processing device, and the training sample processing device can use software and / or hardware way to achieve.

[0033] The following description will be made by taking the training sample processing device as the main body executing the training sample processing method as an example. refer to figure 1 , the training sample processing methods include:

[0034] S110. Acquire the dotting event data, analyze the dotting event data through the stream computing engine, analyze the user response data of the recommended content corresponding to the dotting event, and use the user reaction data as a label corresponding to the dotting event data.

[0035]Specifically, the dotting event data is the user behavior data collected by the client, which can ref...

Embodiment 2

[0074] On the basis of the above examples, Figure 7 It is a schematic structural diagram of a training sample processing device provided in Embodiment 2 of the present application. refer to Figure 7 , The training sample processing device provided in this embodiment specifically includes: a label analysis module 21 , a feature preprocessing module 22 , a data summary module 23 and a training sample generation module 24 .

[0075] Wherein, the tag parsing module 21 is configured to obtain the dotting event data, analyze the dotting event data through the stream computing engine, parse out the user response data of the recommended content corresponding to the dotting event, and use the user reaction data as the corresponding Label the event data;

[0076] The feature preprocessing module 22 is configured to acquire feature data corresponding to the recommended content, and preprocess the feature data through the stream calculation engine based on a preset first preprocessing...

Embodiment 3

[0082] Embodiment 3 of the present application provides an electronic device, referring to Figure 8 , the electronic equipment includes: an input device 33, an output device 34, a memory 32 and one or more processors 31; the memory 32 is used to store one or more programs; when the one or more programs are described One or more processors 31 execute, so that the one or more processors 31 implement the training sample processing method provided in the first embodiment above. The electronic device provided above can be used to execute the training sample processing method provided in Embodiment 1 above, and has corresponding functions and beneficial effects.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention discloses a training sample processing method and device, equipment and a storage medium. The method comprises the following steps: acquiring dotting event data, analyzing the dotting event data through a stream computing engine, analyzing user response data of recommended content corresponding to a dotting event, and taking the user response data as a label of the corresponding dotting event data; obtaining feature data corresponding to the recommended content, and preprocessing the feature data through a stream computing engine based on a preset first preprocessing rule; storing the dotting event data into a distributed column database, and storing tags of the dotting event data and the corresponding preprocessed feature data into associated fields of the dotting event data; and after the tags of the dotting event data are all stored in the corresponding associated fields, taking the dotting event data and the data in the associated fields as training samples, and storing the training samples in a distributed message system or a distributed file system so as to solve the problem that the training samples cannot be output in real time.

Description

technical field [0001] The embodiments of the present application relate to the technical field of data processing, and in particular, to a training sample processing method, device, device, and storage medium. Background technique [0002] In scenarios such as short video recommendations, live broadcast recommendations, and advertisements, the timeliness of recommended content is increasingly important. In the recommendation system, timeliness plays a very important role in the recommendation effect. The faster the model update speed of the recommendation system, the more it can reflect the user's recent habits, the more it can reflect the latest fashion trend, and the more current and more sensible it can be recommended to users. content of interest. The timeliness of the recommendation system consists of two parts, one is the timeliness of the features, and the other is the timeliness of the model. [0003] In order to achieve time-sensitive content recommendation, the ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/735G06F16/78G06Q30/02
CPCG06F16/735G06F16/7867G06Q30/0255
Inventor 胡志勇孟蕊张冠星
Owner BIGO TECH PTE LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products