Machine learning method, master node, work nodes and system

A working node and machine learning technology, applied in the field of computer communication, can solve problems such as large synchronization overhead, long idle time, and weak processing capacity
CN107944566AActive Publication Date: 2018-04-20HANGZHOU CLOUDBRAIN TECH CO LTD

Patent Information

Authority / Receiving Office
CN · China
Current Assignee / Owner
HANGZHOU CLOUDBRAIN TECH CO LTD
Publication Date
2018-04-20

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The embodiment of the present invention provides a machine learning method, a master node, work nodes and a distributed machine learning system which are used to reduce the synchronization overhead ofmachine learning. The master node of the distributed machine learning system starts a parameter training process, the work nodes that join the parameter training process are determined, and time information corresponding to the parameter training process is sent to the work nodes, wherein the time information includes an end time of the parameter training process. After the work nodes receive a notification that the master node determines that the work nodes join the parameter training process, the time information corresponding to the parameter training process sent by the master node is obtained. The parameter training is carried out in a time range indicated by the time information, and the master node updates global parameters based on obtained training sub results after the master node receives training sub results sent by work nodes which join the parameter training process.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The invention relates to the technical field of computer communication, in particular to a machine learning method, a master node, a working node and a distributed machine learning system. Background technique

[0002] With the advent of the era of big data, big data processing technology has gradually developed. With the increase of input training data and data models, there are problems such as memory limitation and time limitation in machine learning training of a single node, so distributed machine learning came into being. The distributed machine learning system includes a master node and multiple working nodes. Its core goal is that the master node disassembles computing tasks into multiple small tasks, and distributes them to the processors of multiple working nodes for calculation. That is to say, different working nodes correspond to the same training model. After each working node is assigned different data for parameter training, the traini...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More