Distributed machine learning task online scheduling method based on side cloud cooperation

A technology for machine learning and scheduling methods, applied in machine learning, instrumentation, program startup/switching, etc., can solve problems such as low latency, inability to capture edge network characteristics, and limited edge server resources.

Pending Publication Date: 2021-03-30
南京万般上品信息技术有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, training ML models in EI faces many challenges: (1) Compared with remote clouds, edge servers have the advantage of low latency, but their computing resources are relatively scarce. Therefore, how to balance the edge-cloud time Delay and computing power are very important for model training; (2) due to limited edge server resources, it is difficult for all workers and PSs of an ML task to be placed on the same edge server, so workers and PSs need to communicate frequently, which will further slow down Ho...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Distributed machine learning task online scheduling method based on side cloud cooperation
  • Distributed machine learning task online scheduling method based on side cloud cooperation
  • Distributed machine learning task online scheduling method based on side cloud cooperation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0052] The technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some of the embodiments of the present invention, not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative work all belong to the protection scope of the present invention.

[0053] see Figure 1-4 , an online scheduling method for distributed machine learning tasks based on edge-cloud collaboration. In this embodiment, we use S=200 edge servers, each edge server is equipped with 0:5 workers and 0:3 PSs, The types of worker and PS are set to 8:10, and their bandwidths are set to [100,5*1024]Mbps and [5,20]Gbps respectively. The configuration of the ML task is as follows: E j ∈[50,100],D j ∈[50,100], B j ∈[10,50],m j ∈[0....

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a distributed machine learning task online scheduling method based on edge cloud cooperation. The method comprises the following steps: step 1, modeling an ML task scheduling problem of minimizing total completion time by using integer linear programming; 2, establishing a lower bound of a target function based on the irrelevant parallel machine model, replacing xjsw and njof the original problem with a new variable njsw, and customizing the original problem again; 3, decoupling the re-customized problem into two optimization problems; 4, for each arrived ML task, sequentially distributing Dj data blocks of the ML task to a worker; and step 5, recording a newly arrived ML task set as [Ja], and after all tasks in the [Ja] are allocated with works, sequentially judging whether each task is using the worker or not. The method has the advantages that cloud and edge server providers can be helped to maximize resource utilization of the cloud and edge server providers so as to obtain economic benefits as high as possible, and meanwhile, dynamic scheduling is performed according to the task load and the demand difference of each user so as to enable the overall training time to be shortest.

Description

technical field [0001] The present invention relates to the technical field of edge computing, in particular to an online scheduling method for distributed machine learning tasks based on edge-cloud collaboration. Background technique [0002] With the breakthrough of 5G and the emergence of edge computing, it has become a trend to push the frontier of artificial intelligence (AI) to the edge, and a new field - edge intelligence (EI) has been generated. Many large companies have conducted extensive research on EI and produced corresponding products. For example, IBM has developed IEAM for synchronous training of machine learning (ML) models, and Google has designed special application integrated circuits for running AI interfaces on edge networks. Edge TPU. In the field of EI, how to train ML tasks is an important issue. ML tasks with the characteristics of computation-intensive and time-consuming usually use data parallel mode and parameter server (PS) framework for model...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F9/48G06N20/00
CPCG06F9/4806G06N20/00
Inventor 王讷周睿婷李宗鹏黄浩
Owner 南京万般上品信息技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products