Parameter synchronization optimization method and system suitable for distributed machine learning

A machine learning, distributed technology, applied in the direction of program synchronization, resource allocation, multi-program device, etc., can solve the bottleneck of parameter synchronization, reduce the communication frequency and other problems

Active Publication Date: 2015-06-17
HUAZHONG UNIV OF SCI & TECH
View PDF5 Cites 75 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

By monitoring the resource occupancy of the parameter server, different synchronization time intervals are selected for different working nodes to avoid request bursts, and at the same time ensure that the selected time interval can meet the r

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Parameter synchronization optimization method and system suitable for distributed machine learning
  • Parameter synchronization optimization method and system suitable for distributed machine learning
  • Parameter synchronization optimization method and system suitable for distributed machine learning

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0063] In order to make the objectives, technical solutions and advantages of the present invention clearer, the following further describes the present invention in detail with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, but not to limit the present invention. In addition, the technical features involved in the various embodiments of the present invention described below can be combined with each other as long as they do not conflict with each other.

[0064] figure 1 It is the structural block diagram of the parameter synchronization optimization system of the present invention. Such as figure 1 As shown, the parameter synchronization optimization system of the present invention includes a resource monitoring and allocation module and a parameter maintenance module at the parameter server side, a server resource request module at each working node sid...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a parameter synchronization optimization method and system suitable for distributed machine learning. A machine learning algorithm achieved in a parameter server distribution mode is used for overcoming bottlenecks, such as a large amount of parallel machine learning training time delay caused by insufficient network and parameter server resources, of an existing algorithm in the parameter synchronization process. The system comprises a resource monitoring and distributing module at the parameter server end, a parameter maintaining module at the parameter server end, server resource request modules of all working nodes, parameter synchronization time interval control modules of the working nodes, non-synchronization time accumulation modules of the working nodes, parameter calculation modules of the working nodes and parameter synchronization modules of the working node. According to the parameter synchronization optimization method and system, different synchronization time intervals are selected for the different working nodes to avoid request emergency situations by monitoring resource occupancy conditions of a parameter server; meanwhile, it is guaranteed that the selected time intervals can meet the requirements for communication frequency reducing and training accurate rate guaranteeing at the same time, and the bottlenecks of an existing distributed machine learning system in the parameter synchronization process are effectively avoided.

Description

technical field [0001] The invention belongs to the interdisciplinary technical field of distributed computing and machine learning, and specifically relates to a parameter synchronization optimization method and system suitable for distributed machine learning. Background technique [0002] With the advent of the era of big data, machine learning algorithms, especially deep learning algorithms suitable for large-scale data, are receiving more and more attention and applications, including speech recognition, image recognition, and natural language processing. However, with the increase of the input training data (a type of data used to solve the neural network model in machine learning) and the neural network model, there are memory limitations and weeks or even months of training time for single-node machine learning training. problem, distributed machine learning came into being. Distributed machine learning has received widespread attention in both industry and academia...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F9/52G06F9/50
Inventor 廖小飞王思远范学鹏金海姚琼杰
Owner HUAZHONG UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products