Unlock instant, AI-driven research and patent intelligence for your innovation.

Gradient fusion method and device and computer readable storage medium

A gradient and algorithm technology, applied in the field of deep learning, can solve problems such as complicated selection process and difficult selection of buffer threshold

Inactive Publication Date: 2018-12-11
INSPUR BEIJING ELECTRONICS INFORMATION IND
View PDF4 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] The purpose of the present invention is to provide a gradient fusion method, device and computer-readable storage medium to solve the problem that the selection of the buffer threshold in the traditional gradient fusion method is difficult and the selection process is complicated

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Gradient fusion method and device and computer readable storage medium
  • Gradient fusion method and device and computer readable storage medium
  • Gradient fusion method and device and computer readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] The core of the present invention is to provide a gradient fusion method, device and computer-readable storage medium, which simplifies the gradient fusion process and improves the gradient fusion efficiency.

[0035] In order to enable those skilled in the art to better understand the solution of the present invention, the present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments. Apparently, the described embodiments are only some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0036] A method embodiment of gradient fusion provided by the present invention is introduced below, see figure 1 , this embodiment is applied to deep learning distributed training, specifically...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a gradient fusion method, which includes: the input tensor on each node in the depth learning distributed training architecture is determined, then the input tensor is transferred to the fusion buffer one by one to determine whether the number of input tensors in the fusion buffer is greater than the number of preset tensors, if the number of input tensors in the fusion buffer is greater than the number of preset tensors, the input tensors in the fusion buffer are processed by an allreduce algorithm to obtain the output tensors. As can be seen, the present invention determines whether to continue transferring tensor according to the magnitude of the number of tensors in the fusion buffer and the number of preset tensors. Compared with a method of gradient fusion based on the magnitude of all tensors in the buffer, the method of the invention solves the problem that the threshold is difficult to select due to the large difference of tensor magnitude, avoids the process of calculating and accumulating the magnitude of each tensor in the fusion buffer, thus simplifies the process of gradient fusion and improves the efficiency. The invention also provides a gradient fusion device and a computer-readable storage medium, the function of which corresponds to the function of the method.

Description

technical field [0001] The present invention relates to the field of deep learning, in particular to a gradient fusion method, device and computer-readable storage medium. Background technique [0002] Deep learning has made great progress in the past few years, especially in the fields of voice, image, machine translation, and natural language processing. Deep learning training requires massive data, which requires ultra-large-scale parameters. network model fitting. If the training data is insufficient, if the network model parameters are too few, it will cause underfitting and the model accuracy will be low. At present, the common network model parameters have reached hundreds of millions, and the parameter size has reached several gigabytes. The data parallel training method requires each GPU node to have a complete copy of the model parameters, and to send and receive complete gradient data when merging gradients. The huge amount of communication data brings huge netw...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06T5/50
CPCG06T5/50G06T2207/20081G06T2207/20221
Inventor 黄雪刘姝
Owner INSPUR BEIJING ELECTRONICS INFORMATION IND