Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

GPU (Graphics Processing Unit) resource scheduling method and system based on multi-dimensional combination parallelism

A technology of resource scheduling and resource scheduling module, applied in the field of graphics processor and computing, can solve the problems of high transformation cost and insufficient utilization of resources static pooling, and achieve the effect of improving utilization and scalability.

Active Publication Date: 2022-07-12
ZHEJIANG LAB
View PDF10 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

First of all, for the problem of resource utilization, the traditional solution is generally based on the optimization of the cloud-native cluster scheduler, and the transformation cost is relatively high; and for the problem of exclusive resource sharing, each cloud platform generally adopts node locking and resource pooling solutions. On the one hand, reasonable resource allocation needs to be carried out in advance; on the other hand, the static pooling of resources has the disadvantage of insufficient utilization

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • GPU (Graphics Processing Unit) resource scheduling method and system based on multi-dimensional combination parallelism
  • GPU (Graphics Processing Unit) resource scheduling method and system based on multi-dimensional combination parallelism
  • GPU (Graphics Processing Unit) resource scheduling method and system based on multi-dimensional combination parallelism

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] In order to make the objectives, technical solutions and technical effects of the present invention clearer, the present invention will be further described in detail below with reference to the accompanying drawings and embodiments of the description.

[0037] like figure 1 As shown, a GPU resource scheduling method based on multi-dimensional combination and parallelism, the method specifically includes the following steps:

[0038] Step 1, add plug-in modules of various types of GPU resources in the GPU resource management center.

[0039] Each plug-in module runs based on GPU resources and initialized configuration information. The configuration information includes GPU type, GPU specific model, list of supported resource scheduling labels, driver installation scripts suitable for GPU heterogeneous clusters, and various types of GPU heterogeneous clusters. The IP list and connection method corresponding to the GPU node; the resource scheduling label of the supported...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a GPU (Graphics Processing Unit) resource scheduling method and system based on multi-dimensional combination parallelism. The method comprises the following steps of: 1, adding a GPU resource plug-in module in a GPU resource management center; 2, running a plug-in module, and completing corresponding linkage of various types of GPU resources of a data layer and a physical layer; 3, the GPU resource management center collects real-time basic information of GPU resources from the GPU heterogeneous cluster and issues the real-time basic information to a GPU resource scheduling module; 4, when the GPU resources are called, a request task supporting multi-dimensional combined scheduling is sent to a GPU resource scheduling module; and 5, the GPU resource scheduling module converts the main task of the request task into an executable single-dimensional scheduling sub-task and issues the sub-task to the GPU heterogeneous cluster, and the GPU heterogeneous cluster allocates GPU resources as required according to the sub-task. According to the invention, the utilization rate of GPU resources can be effectively improved.

Description

technical field [0001] The invention relates to the fields of graphics processors and computing technologies, and in particular to a GPU resource scheduling method and system based on multi-dimensional combined parallelism. Background technique [0002] In the era of AI (English full name Aritificial Intelligence, Chinese translation for artificial intelligence), algorithm engineers need to perform a large number of deep learning tasks, usually using Docker containers as training environments, using expensive Graphics Processing Unit (Graphics Processing Unit, referred to as GPU) cards can be Significantly improve the training speed, and distributed training is also the most frequent training scenario at the moment. When algorithm engineers need to use GPU resources, they need to allocate GPU resources. On the one hand, how to maximize the use of idle GPU resources is a problem that needs to be solved; on the other hand, the performance of different types and models of GPU c...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F9/50G06F9/48
CPCG06F9/5038G06F9/5066G06F9/4881G06F2209/484G06F2209/5021Y02D10/00
Inventor 叶玥里哲崔广章
Owner ZHEJIANG LAB
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products