Automated task parallel method suitable for distributed machine learning and system thereof

A machine learning and distributed technology, applied in database distribution/replication, instruments, computer components, etc.

Active Publication Date: 2016-09-21
HUAZHONG UNIV OF SCI & TECH
View PDF5 Cites 43 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The above method and system can effectively solve the network transmission bottleneck problem in the existing distributed machine learning system and improve the system concurrency, thereby improving the overall performance of the system

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Automated task parallel method suitable for distributed machine learning and system thereof
  • Automated task parallel method suitable for distributed machine learning and system thereof
  • Automated task parallel method suitable for distributed machine learning and system thereof

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0062] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention. In addition, the technical features involved in the various embodiments of the present invention described below can be combined with each other as long as they do not constitute a conflict with each other.

[0063] figure 1 It is a module block diagram of the automatic task parallel execution system of the present invention. Such as figure 1 As shown, the automatic task parallel system of the present invention specifically includes a working node module, a service node module, a master node module, a tensor module, a scheduling module, a message tracking module, a stage module, a stage group modul...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides an automated task parallel method suitable for distributed machine learning and a system thereof. The method and the system solve defects of a programming interface in existing distributed machine learning, and tight coupling of system data access behaviors and application logic caused by just providing a reading-writing interface of key value pairs. The defect intensifies network bandwidth resource competition in distributed cluster, and causes that programming personnel is difficult to perform parallelization on a task. The system comprises a work node module, a service node module, a host node module, a tensor module, a scheduling module, a message tracking module, a stage module, a stage group module, and an executing engine module. Through providing higher-level programming abstraction, the system decouples logic of reading-writing access behaviors and an application program. In operation, the system firstly dynamically partitions tasks according to the load condition of a service node, and then machine learning tasks are automatically executed in a parallel manner, so as to greatly reduce burden of programming personnel to compile high-concurrency machine learning applications.

Description

technical field [0001] The invention belongs to the cross-technical field of distributed computing and machine learning, and in particular relates to a parallel method and system for automatic tasks suitable for distributed machine learning. Background technique [0002] As a traditional method of mining data value, machine learning algorithms are widely used in fields such as natural language processing, text analysis, speech recognition, automatic driving of motor vehicles, and bioinformatics. With the advent of the era of big data, the value of data is becoming more and more prominent, especially the commercial value contained in it, so machine learning has been valued. However, as the scale of data and the corresponding model parameters that need to be learned become larger and larger, a single computing node cannot meet the needs of large-scale machine learning due to its limited memory resources, computing resources, and memory access bandwidth resources. Distributing...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06K9/62
CPCG06F16/24532G06F16/27G06F18/2323
Inventor 廖小飞曹镇山郭人通刘海坤金海陆枫
Owner HUAZHONG UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products