Adaptive distributed parallel training method for neural network based on reinforcement learning

A reinforcement learning and neural network technology, applied in the field of model parallel training schemes, can solve the problems of inability to guarantee other performance of the strategy, single parallel dimension, etc., to achieve the effect of expanding offline learning capabilities, speeding up, and improving comprehensive performance
CN113128702APending Publication Date: 2021-07-16HANGZHOU DIANZI UNIV

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Applications(China)
Current Assignee / Owner
HANGZHOU DIANZI UNIV
Publication Date
2021-07-16

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The invention discloses an adaptive distributed parallel training method for a neural network based on reinforcement learning, and provides an optimal solution for segmentation and scheduling of a large-scale complex neural network. Firstly, the influence of a neural network model structure and calculation attributes on execution performance is analyzed, on this basis, performance factors including calculation cost, communication cost, memory utilization rate and the like are extracted, a multi-dimensional performance evaluation model capable of comprehensively reflecting distributed training performance is constructed, and comprehensive performance of a parallel strategy is improved; secondly, self-adaptive grouping of operators is realized according to attribute characteristics of the operators by utilizing a feed-forward network, the degree of parallelism is determined, and end-to-end strategy search is realized while the search space is reduced; and finally, based on importance sampling, a near-end strategy gradient iteration optimization reinforcement learning model is adopted, an optimal segmentation and scheduling strategy is searched, the strategy network offline learning capability is expanded, and algorithm stability, convergence rate and strategy search performance are improved.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The invention relates to a neural network adaptive distributed parallel training method based on reinforcement learning, which provides an optimal model parallel training scheme for large-scale complex neural networks. Background technique

[0002] In recent years, benefiting from the development of AI algorithms, hardware computing power, and data sets, deep neural network technology has been widely used in natural language processing, computer vision, and search recommendation. As these fields continue to iteratively develop larger-scale and more complex neural networks, it is difficult for "Moore's Law" to match the computing needs, and a single device can no longer support large-scale deep network training. Therefore, it has become a common method to solve large-scale neural network training by researching and dividing the neural network calculation graph, and scheduling the divided network to clusters containing multiple CPUs and GPUs to achieve m...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More