Deep learning model training method, working node and parameter server

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A technology for model training and working nodes, applied in the field of deep learning, which can solve problems such as low model training efficiency

Pending Publication Date: 2020-12-01

LYNXI TECH CO LTD

View PDF4 Cites 7 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005] Embodiments of the present invention provide a deep learning model training method, working nodes, and parameter servers to solve the problem of low model training efficiency in the process of using multiple working nodes to train the same deep learning model in the related art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0033] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0034] In the training process of large-scale deep learning models, in order to speed up model convergence and improve training efficiency, and considering that the total number of samples may be large (it is impossible to use all sample data for model iteration), the small batch gradient descent method is usually used for model training. . Among them, when using the small batch gradient descent method for model training, each iteration uses batch size...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The embodiment of the invention provides a deep learning model training method, a working node and a parameter server. The deep learning model training method applied to the working node comprises thesteps that a first statistical parameter sent by a parameter server is received, and the first statistical parameter is determined by the parameter server according to historical training data of a target layer of a target model; when the target layer is trained based on target batch training samples, target statistical parameters of the target layer are obtained, and the target statistical parameters are statistical parameters of the target batch training samples; and an actual statistical parameter of the target layer is determined based on the first statistical parameter and the target statistical parameter, batch standardization is performed on the target batch training samples based on the actual statistical parameter, and the target statistical parameter is sent to the parameter server. According to the embodiment of the invention, the training efficiency of the deep learning model can be improved.

Description

technical field [0001] The invention relates to the technical field of deep learning, in particular to a deep learning model training method, a working node and a parameter server. Background technique [0002] With the development of information technology, the use of deep learning models for training to use the trained models to predict target data has been more and more widely used. In order to further improve the accuracy of the trained models, the training sample The number is also increasing, which results in the complexity of training and longer training time. [0003] In related technologies, usually multiple working nodes can be used to train the same model. For example, different working nodes are responsible for training different training layers in the same model. At this time, the next training layer needs to wait for the training of the previous training layer to complete. Being able to perform a training process whose waiting time greatly increases the overal...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G06N20/00

CPCG06N20/00

Inventor徐茂轩吴臻志

OwnerLYNXI TECH CO LTD

Deep learning model training method, working node and parameter server

What is AI technical title? AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document. A technology for model training and working nodes, applied in the field of deep learning, which can solve problems such as low model training efficiency

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A technology for model training and working nodes, applied in the field of deep learning, which can solve problems such as low model training efficiency

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology