Supercharge Your Innovation With Domain-Expert AI Agents!

Method and system for scheduling and deploying deep learning models under limited hardware resources

A hardware resource, deep learning technology, applied in the field of deep learning model scheduling and deployment, can solve the problems of inability to finely control computing resources, difficult to expand, complex dependencies, etc., to achieve the effect of convenient subsequent expansion and upgrade

Active Publication Date: 2021-10-08
上海体素信息科技有限公司
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The optimization direction of these engines and tools is more focused on their own hardware or the framework itself, and it is impossible to optimize the effective automatic loading and unloading and reasonable scheduling of the deployed multi-model according to the characteristics of the inference task under the condition of limited computing resources.
In the existing medical image recognition scenario, the overall business process is jointly completed by multiple large-scale deep model reasoning tasks with complex dependencies, including multiple modules such as pre-processing, model reasoning, and post-processing. complex
Because the GPU memory is limited and difficult to expand, the overall reasoning task is limited by the memory, and the reasoning process takes a long time
Although most of the existing inference deployment engines can deploy multiple model services at the same time, under the condition of video memory limitation, they cannot reasonably schedule model inference tasks according to the resource consumption of model inference runtime, and cannot finely control computing resources.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for scheduling and deploying deep learning models under limited hardware resources
  • Method and system for scheduling and deploying deep learning models under limited hardware resources

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0080] The present invention relates to the scheduling and deployment of depth learning models. The present invention provides a multi-task scheduling method that can make full utilization of multi-task, flexible scheduling model, and allocation of a defined computational resource, flexibility scheduling model, and allocation resource. It can realize that when there is a complex dependency of multiple depth learning models, based on the deployment of environmental resources, the depth learning model will be flexibly scheduled, so that limited computing resources will be reasonably utilized to reduce the time consumption of overall reasoning tasks.

[0081] Complex depth learning model reasoning tasks can generally be combined by multiple model reasoning, phase tasks, and phase tasks associated with the pre-processing before and after the introduction of these model reinforcing tasks. A complex reasoning task is completed in accordance with certain dependent order.

[0082] These s...

Embodiment 2

[0161] Example 2 is a modification of Example 1

[0162] When the framework is deployed to process the system task of the medical image identifier. The framework will initialize the processing component and phase component information, start the scheduling component thread, and provide services in the main thread.

[0163] like figure 2 As shown, after receiving the medical image input request of 5 patients, the main thread will build 5 task components JOB, JOB_1-JOB_5, and generate each task component job as needed to complete the four models to generate 4 STAGE 4 Phase Status Component Task, Task_a_1 that identifies the phase status component TASK when the corresponding task component JOB_1 needs to execute the phase component STAGE_A model. And according to the status of the stage status component TASK, add Phase State Component Task to the waiting cell or in-pool.

[0164] like figure 1 As shown, the scheduling component Scheduler in the scheduling component SCHEDULER is expec...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention provides a deep learning model scheduling and deployment method and system under limited hardware resource conditions, including: Step M1: Obtain multiple phase tasks with dependencies in deep learning reasoning tasks; Step M2: Perform multiple phase tasks Resource consumption is evaluated separately; Step M3: Evaluate the currently available deployment hardware resources; Step M4: Select the currently runnable stage tasks from multiple dependent stage tasks; schedule the currently runnable stage tasks according to the currently available deployment hardware resources Stage tasks and update the evaluation of available deployment hardware resources, and repeat step M4 until all stage tasks in the deep learning reasoning task are completed. The invention realizes efficient adaptation of computing resources in different deployment environments and flexible adaptation of subsequent reasoning model changes.

Description

Technical field [0001] The present invention relates to the scheduling and deployment of depth learning models, and in particular, to the deployment method and system scheduling deployment method and system for depth learning models under limited hardware resource conditions. Background technique [0002] With the rapid development of computer hardware and the rapid development of deep learning, various neural network models are applied to various fields such as life health, retail, industrial. The successful application of depth learning models is in business sectors depending on multiple links. In addition to model training, it is often necessary to optimize and deploy training well-trained models for use. The user passed to the deployed model to the data, and after the input data passes the model reasoning, the user gets the corresponding output result. [0003] In order to optimize the model reasoning process, the model reasoning engine that is convenient for industrial deplo...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F9/50G06F9/54G06N5/04
CPCG06F9/5038G06F9/546G06N5/04
Inventor 陈伟睿党康王子龙
Owner 上海体素信息科技有限公司
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More