Distributed depth learning system based on momentum and pruning

A deep learning and distributed technology, applied in the direction of neural learning methods, neural architecture, biological neural network models, etc., can solve the problems of many cluster nodes, slow batch synchronization, and large weight dimensions of deep learning models, so as to accelerate the model The effects of slow convergence, increased utilization, and improved parameter update mechanism

Active Publication Date: 2019-02-01
ANHUI UNIVERSITY OF TECHNOLOGY +1
View PDF5 Cites 19 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In 2012, some scholars proposed a distributed stochastic gradient descent algorithm. However, the batch synchronization implementation of stochastic optimization is usually slow in practice because it needs to wait for the slowest machine in each synchronization batch.
Based on the traditional distributed deep learning system, the weight dimension of the deep learning model is large, there are many cluster nodes, and the interactive communication load is relatively serious

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Distributed depth learning system based on momentum and pruning
  • Distributed depth learning system based on momentum and pruning
  • Distributed depth learning system based on momentum and pruning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] In order to better understand the technical content of the present invention, specific embodiments are given together with the attached drawings for description as follows.

[0047] Aspects of the invention are described in this disclosure with reference to the accompanying drawings, which show a number of illustrated embodiments. Embodiments of the present disclosure are not necessarily intended to include all aspects of the invention. It should be appreciated that the various concepts and embodiments described above, as well as those described in more detail below, can be implemented in any of numerous ways, since the concepts and embodiments disclosed herein are not limited to any implementation. In addition, some aspects of the present disclosure may be used alone or in any suitable combination with other aspects of the present disclosure.

[0048] The present invention integrates the Apache Spark big data cluster framework and Caffe deep computing capability, des...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a distributed depth learning system based on momentum and pruning, which relates to the field of cloud computing and depth learning. A spark distributed cluster is adopted, comprising a main control node and a plurality of working nodes. Any working node is connected with the main control node through a communication link, and the working node stores part of training data. According to the batch of training data stored locally, the work node propagates the training of the depth learning model forward and backward, obtains the update quantity of the weight parameter of the work node, and sends it to the main control node for interactive communication with the global weight parameter of the main control node. The main control node records the node information of the work node, balances the weight parameters of the obtained work node and transmits the obtained weight parameters to each work node. Repeating the interactive communication between the master control node and the working node until the iterative number or convergence condition is reached; The invention solves the problem that the model of the asynchronous algorithm converges slowly under the distributed environment, and at the same time, the operation speed of the asynchronous algorithm is improved.

Description

technical field [0001] The invention relates to the field of cloud computing and deep learning, in particular to a distributed deep learning system based on momentum and pruning. Background technique [0002] In the field of machine learning problems, deep learning has made remarkable progress. Relying on massive training data, deep learning methods can train more accurate models. However, the main disadvantage of the deep model is that there are many model parameters, and the calculation amount increases exponentially with the increase of the number of layers, so a distributed system is needed to manage such a huge calculation task and data. ApacheSpark is one of the mainstream distributed systems at present. It has high-performance memory data processing speed and fault tolerance. Therefore, the combination of deep learning and Spark framework has attracted extensive attention. Relevant researchers have proposed SparkNet, CaffeOnSpark, DeepSpark and Frameworks such as Te...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/04G06N3/08
CPCG06N3/08G06N3/084G06N3/045
Inventor 郑啸邵光梅梁越永何诗兴后盾
Owner ANHUI UNIVERSITY OF TECHNOLOGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products