Model training method, device and equipment based on pipeline parallelism

A model training and assembly line technology, applied in the field of machine learning, to achieve the effect of improving parallelism, ensuring training effect, and improving equipment utilization

Active Publication Date: 2021-07-27
ALIPAY (HANGZHOU) INFORMATION TECH CO LTD
View PDF11 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The traditional model parallel technology can only ensure that one wo

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Model training method, device and equipment based on pipeline parallelism
  • Model training method, device and equipment based on pipeline parallelism
  • Model training method, device and equipment based on pipeline parallelism

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0049] Embodiments of this specification provide a model training method, apparatus, apparatus, and storage medium based on pipeline parallel.

[0050] In order to better understand the technical solution in the present specification, the technical solutions in the present specification are clear and completely described below in conjunction with the drawings in the present specification. The embodiment is merely the embodiment of the present application, not all of the embodiments. Based on the present specification, one of ordinary skill in the art shall belong to the scope of the claims of the present application without the pre-creative labor premise.

[0051]In one or more embodiments of the present specification, the workfare means that the device, process, etc. for performing model training, etc., generally a single worker (can be referred to as a worker), or a working machine composed of a plurality of workers. Cluster (can be called Stage), etc. The model is parallel, and...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention discloses a model training method based on pipeline parallelism. The model training method comprises the steps of segmenting a to-be-trained model according to a normal training sequence to obtain a plurality of sub-slices; dividing a part of sub-slices with the normal training sequence in the front into a first group to form a first sub-model, and dividing the remaining sub-slices into a second group to form a second sub-model; distributing the sub-slices contained in the first sub-model to different working parties, and distributing the sub-slices contained in the second sub-model to different working parties; according to the training sequence obtained through adjustment, distributing training time blocks to the sub-slices on the working party to which the sub-slices are distributed, wherein the training sequence comprises the following steps: for a single-round training sample, sequentially executing the forward propagation of the first sub-model in the current period, the backward propagation of the first sub-model in the previous period, the forward propagation of the second sub-model in the current period and the backward propagation of the second sub-model in the current period; and according to the training time block, carrying out parallel training on the to-be-trained model on the plurality of working parties.

Description

technical field [0001] This specification relates to the field of machine learning technology, and in particular to a pipeline-based parallel model training method, device and equipment. Background technique [0002] In recent years, deep learning has greatly promoted the development of artificial intelligence technology, and has achieved good results in many fields. However, with the development of artificial intelligence technology, the parameter scale of the model is also increasing, and its growth rate has even far exceeded the growth rate of hardware processing performance. [0003] In order to solve this contradiction, model parallel technology has gradually attracted the attention of researchers. Model parallelism can divide a large-scale model into multiple model slices and distribute them to different working machines, so as to realize the training or reasoning of large-scale models. The traditional model parallel technology can only ensure that one working machin...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06N3/04G06N3/08
CPCG06N3/084G06N3/045
Inventor 杨鹏程张晓明杨明魏宏
Owner ALIPAY (HANGZHOU) INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products