DNN reasoning task batch scheduling method oriented to heterogeneous cluster

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A heterogeneous cluster and batch scheduling technology, which is applied in the direction of reasoning methods, electrical digital data processing, multi-programming devices, etc., can solve problems such as research on the distribution characteristics of few tasks

Pending Publication Date: 2022-05-27

SUZHOU INST FOR ADVANCED STUDY USTC

View PDF0 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

In traditional task scheduling, little research has been done on the distribution characteristics of tasks in the task flow

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0097] Large data centers usually have very large heterogeneous computing clusters that can accommodate hundreds to thousands of computing nodes. There are a lot of trained neural network models deployed. Outside the cluster, a wide range of user groups use various DNN applications to send DNN inference task requests to the data center for inference. These tasks correspond to different inference models and allowable inference delays. The DNN task inference scheduling in this scenario includes the following three processes: cluster DNN inference request collection, DNN inference task initialization, and computing node DNN inference. The DNN-BS scheduling optimization method mainly acts on how to schedule each task to each computing node with different computing power after these DNN inference tasks reach the cluster. DNN-BS can analyze the dynamic hardware processing capability of the current cluster heterogeneous computing node division, and find the most suitable task-compu...

Embodiment 2

[0099] With the development of technologies such as the Internet of Things, artificial intelligence, and smart embedded devices, it is no longer just deploying a few large data centers to handle all DNN inference tasks. Instead, multiple small processing clusters are deployed where the network edge devices are concentrated. In this case, local-scale DNN inference task scheduling is required on small processing clusters at the network edge.

[0100] For such scenarios, the DNN-BS scheduling method can analyze all DNN applications at the edge of the network, and first classify the DNN reasoning task. At the same time, all computing nodes of the edge small cluster are divided, and then the dynamic hardware processing capability matrix is obtained. When the local DNN inference task reaches the small processing cluster, the weight parameters in the optimization objective function can be adjusted according to the scheduling target, so as to continue the complete DNN inference tas...

Embodiment 3

[0102] Smart homes also face such problems. In a smart home there is usually a processing center that processes the data from all the sensors. These data arrive at the processing center and turn into DNN inference tasks. And often there is more than one hardware processing unit in the processing center. Then all types of data analysis inside the smart home can be attributed to a limited number of DNN inference task categories, which are given to multiple hardware processing units in the processing center for inference.

[0103] In this scenario, the DNN-BS scheduling optimization method can be deployed on the scheduling nodes of the home small processing center, and the dynamic hardware processing capability matrix can be obtained by obtaining the inference time of all types of DNN inference tasks in each hardware processing unit. Afterwards, all DNN inference tasks can be scheduled to the corresponding hardware processing units for inference through the remaining DNN-BS sch...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a DNN reasoning task batch scheduling method oriented to a heterogeneous cluster. The method comprises a DNN reasoning task representation module, a heterogeneous cluster dynamic hardware processing capability extraction module and a DNN reasoning task batch scheduling module. The DNN reasoning task characterization module is used for classifying DNN reasoning tasks according to task types to construct a DNN reasoning task category set; the heterogeneous cluster dynamic hardware processing capability extraction module is used for dividing computing nodes and constructing a heterogeneous computing node set according to a division result; and the DNN reasoning task batch scheduling module is used for scheduling the DNN tasks according to the DNN reasoning task division set and the heterogeneous computing node set through a target search algorithm. Through the arrangement of the modules, effective classification of the DNN inference tasks is realized, measurement of computing power of different nodes of a heterogeneous cluster is solved, optimal scheduling of the DNN inference tasks is realized, and optimization of batch scheduling of the DNN inference tasks oriented to the heterogeneous cluster is realized.

Description

technical field [0001] Embodiments of the present invention relate to a DNN inference task batch scheduling technology, and in particular, to a DNN inference task batch scheduling method oriented to heterogeneous clusters. Background technique [0002] In recent years, with the vigorous development of deep neural network (DNN), artificial intelligence (AI) applications have been fully developed, and the application category and quantity scale have grown significantly. General purpose graphics processing unit (GPGPU) has become the mainstream deep learning network accelerator due to its superior massively parallel computing capability. [0003] Accelerating a DNN network on GPU usually consists of two stages. First, a large-scale training dataset is required to train a selected DNN model. This training process is very time-consuming and is often placed on a computing cluster with high computing power. , and then when the training is completed, the network model will be deplo...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06F9/48G06K9/62G06N5/04

CPCG06F9/4881G06N5/04G06F18/241Y02D10/00

Inventor 王超张仁宇朱宗卫周学海李曦

Owner SUZHOU INST FOR ADVANCED STUDY USTC

DNN reasoning task batch scheduling method oriented to heterogeneous cluster

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology