Distributed system and method for performing machine learning on data records

A distributed system and data recording technology, applied in the field of artificial intelligence, can solve the problems of network overhead and large amount of computation, and achieve the effect of reducing network transmission overhead and achieving homogenization

Active Publication Date: 2019-09-17
THE FOURTH PARADIGM BEIJING TECH CO LTD
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Exemplary embodiments of the present invention aim to overcome the shortcomings of existing distributed machine learning systems that have large network overhead and computational load when performing machine learning

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Distributed system and method for performing machine learning on data records
  • Distributed system and method for performing machine learning on data records
  • Distributed system and method for performing machine learning on data records

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] In order to enable those skilled in the art to better understand the present invention, exemplary embodiments of the present invention will be described in further detail below in conjunction with the accompanying drawings and specific implementation methods.

[0030] Machine learning is an inevitable product of the development of artificial intelligence research to a certain stage. It is committed to improving the performance of the system itself by means of calculation and using experience. In a computer system, "experience" usually exists in the form of "data". Through machine learning algorithms, a "model" can be generated from the data. The model can be expressed as an algorithm function under specific parameters, that is, experience When the data is provided to the machine learning algorithm, a model can be generated based on these empirical data (that is, the parameters of the function are learned based on the data), and when faced with a new situation, the model ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a distributed system for performing machine learning on data record and a method thereof. The system comprises multiple computing devices of which each performs the same data flow computation about machine learning on its own data record; and a parameter server which is used for maintaining the parameters of a machine learning model, wherein when data flow computation for training the machine learning model is performed, the computing devices use the parameters acquired from the parameter server to perform the same operation about machine learning model training on the respective data record, and the parameter server updates the parameters according to the operation result of the computing devices; besides / or when data flow computation for estimation through the machine learning model is performed, the computing devices use the parameters acquired from the parameter server to perform the same operation about machine learning model estimation on the respective data record. Therefore, homogenization between the computing devices can be realized and the network transmission overhead can be reduced.

Description

technical field [0001] Exemplary embodiments of the present invention relate generally to the field of artificial intelligence, and more particularly, to a distributed system for performing machine learning on data records and a method for performing machine learning on data records using the distributed system. Background technique [0002] With the rapid growth of data scale, machine learning is widely used in various fields to mine the value of data. However, in order to perform machine learning, the memory of general physical machines is far from enough. Therefore, in practice, it is often necessary to use a distributed machine learning platform to complete the training of machine learning models or corresponding predictions. [0003] In existing distributed machine learning systems (for example, in Google's deep learning framework TensorFlow), there are usually one or more control nodes, which are responsible for scheduling tasks and computing resources of other computi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06N20/00
Inventor 戴文渊陈雨强杨强焦英翔涂威威石光川
Owner THE FOURTH PARADIGM BEIJING TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products