Unlock instant, AI-driven research and patent intelligence for your innovation.

Parallel training method and device of neural network model and electronic equipment

A neural network model and training method technology, applied in the field of devices, parallel training methods of neural network models, electronic equipment and storage media, can solve the problems of low efficiency of BERT's overall reasoning tasks, improve parallel training speed, and improve processing capabilities Effect

Pending Publication Date: 2022-04-05
TENCENT TECH (SHENZHEN) CO LTD
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Softmax (normalization) is an important operator in the BERT model. If this operator runs on the GPU for too long, it will lead to inefficiency in the overall reasoning task of BERT. Therefore, it is necessary to improve the neural network model through parallel processing. The training speed reduces the training time of the neural network model

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Parallel training method and device of neural network model and electronic equipment
  • Parallel training method and device of neural network model and electronic equipment
  • Parallel training method and device of neural network model and electronic equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0065] In order to make the purpose, technical solution and advantages of the present invention clearer, the present invention will be described in further detail below in conjunction with the accompanying drawings, and the described embodiments should not be considered as limiting the present invention, and those of ordinary skill in the art do not make any All other embodiments obtained under the premise of creative labor belong to the protection scope of the present invention.

[0066] In the following description, references to "some embodiments" describe a subset of all possible embodiments, but it is understood that "some embodiments" may be the same subset or a different subset of all possible embodiments, and Can be combined with each other without conflict.

[0067] Before further describing the embodiments of the present invention in detail, the nouns and terms involved in the embodiments of the present invention are described, and the nouns and terms involved in the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a parallel training method of a neural network model. The method comprises the following steps: segmenting a target neural network model to obtain a block structure of the target neural network model; when the target neural network model is trained, monitoring the state change of the tensor information; determining the state of a block structure of the target neural network model according to the state change of the tensor information; in response to the state of the block structure of the target neural network model, the storage positions of the parameters of the target neural network model corresponding to different block structures are adjusted through the expelling strategy of the block structure, and the invention further provides a parallel training device of the neural network model, electronic equipment and a storage medium. According to the invention, parallel training of the target neural network model can be realized.

Description

technical field [0001] The invention relates to a training technology of a neural network model, in particular to a parallel training method, device, electronic equipment and storage medium of a neural network model. Background technique [0002] A machine learning model usually includes multiple operators. If the calculation time of each operator is too long, the service response will be too slow. For example, for the BERT model (Bid directiona l Encode eRepresentation from Transformers, a general pre-trained language representation model) in the field of natural language, in actual application scenarios, rich online service scenarios can be deployed based on the BERT model. In this scenario, people often use Graphics Processing Unit (GPU) to parallelize the BERT service calculation process to improve online response speed and reduce service delay. Softmax (normalization) is an important operator in the BERT model. If this operator runs on the GPU for too long, it will lea...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/08G06N3/04G06N3/063
Inventor 方佳瑞
Owner TENCENT TECH (SHENZHEN) CO LTD