A training method and training system for a machine learning system

A technology of machine learning and training methods, applied in the field of big data processing, can solve problems such as large influence of model noise, model training stuck, training failure, etc., to enhance the ability to resist data noise, improve service capabilities, and ensure normal output Effect

Active Publication Date: 2020-11-06
ZHEJIANG TMALL TECH CO LTD
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The accuracy of online learning is relatively high, but due to the short period of data collection, the model is greatly affected by the noise in the data, resulting in unstable model training; at the same time, because online learning uses asynchronous model update, training data continues to flow into the model, and the model continues to For update learning, model training is often stuck due to some uncontrollable problems, and training failures occur, resulting in the inability to produce usable models, affecting online services, and damaging user experience

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A training method and training system for a machine learning system
  • A training method and training system for a machine learning system
  • A training method and training system for a machine learning system

Examples

Experimental program
Comparison scheme
Effect test

no. 1 example

[0026] The first embodiment of the present application proposes a training method for a machine learning system, such as figure 1 Shown is a flow chart of the training method of the machine learning system according to the first embodiment of the present application. The machine learning system is preferably a distributed machine learning system, including a parameter server (parameter server). The parameter server may include, for example, multiple workers (workers or slavers), multiple servers (servers), and a coordinator (coordinator). like figure 1 As shown, the training method includes the following steps:

[0027] Step S101, distributing the training data to multiple working machines;

[0028] In this step, for example, each working machine can read its own training data according to its identification, and the data among the working machines do not overlap. In this step, for example, the coordinator may divide the training data into training data belonging to each wo...

no. 2 example

[0045] The second embodiment of the present application proposes a training method for a machine learning system, such as figure 2 Shown is a flow chart of the training method of the machine learning system according to the second embodiment of the present application. The machine learning system is preferably a distributed machine learning system, such as figure 2 As shown, the training method includes the following steps:

[0046] S201, distribute the training data to multiple working machines;

[0047] S202, divide the training data obtained by each working machine into multiple data slices;

[0048] S203. Obtain the local weight and local loss function value calculated by each working machine based on each data piece;

[0049] S204, summarizing the local weights and local loss function values ​​calculated by each working machine based on each piece of data, to obtain the current weight and the current loss function value;

[0050] S205, perform model anomaly detectio...

no. 3 example

[0073] The third embodiment of the present application proposes a training method for a machine learning system, such as image 3 Shown is a flowchart of the training method of the machine learning system according to the third embodiment of the present application. The machine learning system is preferably a distributed machine learning system, such as image 3 As shown, the training method includes the following steps:

[0074] S301. Distributing the training data to multiple working machines;

[0075] S302, divide the training data obtained by each working machine into multiple data slices;

[0076] S303. Obtain the local weight and local loss function value calculated by each working machine based on each data slice;

[0077] S304, summarizing the local weights and local loss function values ​​calculated by each working machine based on each piece of data, to obtain the current weight and the current loss function value;

[0078] S305, perform model anomaly detection b...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present application discloses a training method and system for a machine learning system, using training data to train the machine learning system. The training method includes: distributing the training data to multiple working machines; Divide the training data into multiple data slices; obtain the local weight and local loss function value calculated by each worker based on each data slice; summarize these local weights and local loss function values ​​to obtain the current weight and current loss function value; use The current weight and / or current loss function value performs model anomaly detection; when the detection result is the first type of anomaly, the weight after the last summary and the loss function value after the last summary are input into the machine learning system for training; when the detection The result is the second type of abnormality, the current weight and / or current loss function value is corrected to the current weight and / or current loss function value within the first threshold, and input to the machine learning system for training.

Description

technical field [0001] The present application relates to the field of big data processing, and in particular to a training method and a training system for a machine learning system. Background technique [0002] Nowadays, serving users well is the goal that every Internet company hopes to achieve. To this end, most companies use machine learning to capture user preferences and habits and provide personalized services. For example, various websites can use machine learning systems to collect user behavior data online, provide different search results for users of different genders / ages, and provide services based on user preferences to the greatest extent. [0003] The above-mentioned machine learning system can collect a large amount of user behavior data, such as user browsing / clicking / purchasing / upvoting / posting opinions on posts, etc., and use certain machine learning algorithms to train the machine learning system offline. After obtaining a predictive model, put it o...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06N20/00
CPCG06N20/00G06N5/00G06F16/00G06N20/20G06N5/045
Inventor 周俊
Owner ZHEJIANG TMALL TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products