GPU resource-oriented task scheduling method, device and system

A task scheduling and resource technology, applied in the computer field, can solve the problems of occupying video memory resources, waste of video memory resources, low utilization rate of video memory resources, etc., to achieve the effect of refining granularity and improving utilization rate

Inactive Publication Date: 2019-07-09
北京朗镜科技有限责任公司
View PDF3 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, a single working server in a server cluster is configured with multiple GPUs, and each GPU has a certain amount of video memory resources. Deep learning tasks need to occupy the video memory resources in the GPU.
Since a single task often does not need to use the memory resources of the entire GPU, the traditional method is likely to cause waste of memory resources in the GPU, and the utilization rate of memory resources in the GPU is low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • GPU resource-oriented task scheduling method, device and system
  • GPU resource-oriented task scheduling method, device and system
  • GPU resource-oriented task scheduling method, device and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0057] In order to enable those skilled in the art to better understand the solution of the present application, the technical solution in the embodiment of the application will be clearly and completely described below in conjunction with the accompanying drawings in the embodiment of the application. Obviously, the described embodiment is only It is a part of the embodiments of this application, not all of them. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the scope of protection of this application.

[0058] In the traditional method of managing and scheduling with a single GPU as the smallest unit, since the video memory resources included in the server are relatively large, generally, a single GPU has a video memory resource of 16G. However, the demand for video memory resources for each task will be much smaller than 16G. If each task is assigned a correspon...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention discloses a GPU resource-oriented task scheduling method. . The method comprises the following steps of when a task (such as a first target task) needs to be executed by using a video memory resource in a server cluster, calculating the size of a video memory resource required by the first target task according to the attribute information of the first target task included in the first task request and a task resource use configuration table, and then, according to the available resource state table of the server cluster, determining the GPU meeting the video memory resource size required by the first target task, so that the task execution module executes the first target task by utilizing the target GPU. According to the scheme, when the task request is received, video memory resource estimation of the task is carried out, the available GPU is selected with the video memory resource as the minimum unit to complete the task, and the granularity of resource management and task scheduling is refined. If the video memory resources in one GPU meet the task requirements, the purpose that one GPU executes multiple tasks in parallel can be achieved, and the utilization rate of the video memory resources in the GPU is increased.

Description

technical field [0001] The present application relates to the field of computers, in particular to a GPU resource-oriented task scheduling method, device and system. Background technique [0002] In recent years, artificial intelligence technology has developed rapidly and matured, and many industries have introduced related technologies to improve production efficiency and competitiveness. For example, the intelligent retail industry introduces image intelligent recognition technology to realize unmanned sales, intelligent statistical prediction of sales data, etc. Related deep learning tasks such as image processing, speech recognition, and machine translation usually use a Graphic Processing Unit (GPU for short) on a server cluster as the main computing resource. GPU resource management and task scheduling of server clusters are important links in business systems. [0003] The traditional GPU resource-oriented task scheduling method is to assign a GPU to each task when...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F9/50G06F9/48
CPCG06F9/4881G06F9/505G06F9/5083
Inventor 许欢庆牟永奇李洁汤劲武
Owner 北京朗镜科技有限责任公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products