Adaptive distributed parallel training method for neural network based on reinforcement learning
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Applications(China)
- Current Assignee / Owner
- HANGZHOU DIANZI UNIV
- Publication Date
- 2021-07-16
Smart Images

Figure 1 
Figure 2 
Figure 3
Abstract
Description
technical field
[0001] The invention relates to a neural network adaptive distributed parallel training method based on reinforcement learning, which provides an optimal model parallel training scheme for large-scale complex neural networks. Background technique
[0002] In recent years, benefiting from the development of AI algorithms, hardware computing power, and data sets, deep neural network technology has been widely used in natural language processing, computer vision, and search recommendation. As these fields continue to iteratively develop larger-scale and more complex neural networks, it is difficult for "Moore's Law" to match the computing needs, and a single device can no longer support large-scale deep network training. Therefore, it has become a common method to solve large-scale neural network training by researching and dividing the neural network calculation graph, and scheduling the divided network to clusters containing multiple CPUs and GPUs to achieve m...