Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

GPU-based cluster resource allocation method and device

A GPU cluster and resource allocation technology, applied in the field of resource allocation based on GPU clusters, can solve the problems of GPU resource waste, slow task execution speed, low work efficiency, etc., and achieve the effect of improving resource utilization

Active Publication Date: 2019-11-05
SUZHOU LANGCHAO INTELLIGENT TECH CO LTD
View PDF6 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] Currently, resource scheduling software commonly used in GPU clusters (systems that manage a large number of GPU cards) include pbs (PortableBatch System) and slurm (Simple Linux Utility for Resource Management). Using existing resource scheduling software makes the utilization rate of GPU clusters low , the GPU (Graphic Processing Unit, Graphics Processing Unit) card that allocates processing tasks has a utilization rate of about 30% to 60%, and there is about 30% of resources that are not used and cause waste
[0003] The unit of the scheduler software pbs and slurm to allocate GPU cards is the number of cards. When scheduling GPU resources, one or more GPU cards are allocated to tasks. When tasks are running, the GPU utilization rate is not high, resulting in Some GPU resources are wasted
At the same time, the resource scheduling of the existing scheduler software pbs and slurm has multiplexing of GPU cards, that is, more than two tasks can be run on one GPU card. Although this can make full use of GPU resources, but because a GPU card simultaneously Running multiple tasks slows down task execution, resulting in very low work efficiency and longer task completion cycles
Moreover, the current technology only implements resource scheduling, and there is no related technology for rational allocation of resources. In view of the problems existing in the existing GPU cluster resource scheduling, it is urgent to propose a method based on GPU cluster resource allocation. Method and device capable of improving GPU cluster utilization

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • GPU-based cluster resource allocation method and device
  • GPU-based cluster resource allocation method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0070] In order to make the purpose, technical solution and advantages of the present invention more clear, the embodiments of the present invention will be described in detail below in conjunction with the accompanying drawings. It should be noted that, in the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined arbitrarily with each other.

[0071] The steps shown in the flowcharts of the figures may be performed in a computer system, such as a set of computer-executable instructions. Also, although a logical order is shown in the flowcharts, in some cases the steps shown or described may be performed in an order different from that shown or described herein.

[0072] figure 1 It is a flow chart of a method for allocating resources based on a GPU cluster according to an embodiment of the present invention. According to the flow chart, this embodiment is based on a method for allocating resources of a GPU cluster...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a GPU-based cluster resource allocation method. A GPU cluster comprises a plurality of GPU cards. The method comprises the steps of obtaining to-be-processed tasks, wherein theto-be-processed tasks comprise a large task and a small task; the big task refers to a to-be-processed task of which the required resource quantity is greater than or equal to one GPU card; the small task refers to a to-be-processed task of which the required resource quantity is smaller than that of one GPU card; according to the priority sequence of the to-be-processed tasks, allocating one ormore GPU cards to the big tasks for execution; only one big task is allocated to each GPU card; obtaining the resource surplus of each GPU card executing the task; and for each GPU card with the resource surplus, traversing the unexecuted small tasks according to a priority sequence, if the resource surplus of the GPU cards is found to meet the unexecuted small tasks, allocating the resource surplus of the GPU cards to the small tasks, and updating the resource surplus of the GPU cards. Through the scheme of the invention, the GPU cluster resource utilization rate is improved.

Description

technical field [0001] The invention relates to the field of cloud computing, in particular to a method and device for resource allocation based on GPU clusters. Background technique [0002] Currently, resource scheduling software commonly used in GPU clusters (systems that manage a large number of GPU cards) include pbs (PortableBatch System) and slurm (Simple Linux Utility for Resource Management). Using existing resource scheduling software makes the utilization rate of GPU clusters low A GPU (Graphic Processing Unit, Graphics Processing Unit) card that allocates processing tasks has a usage rate of about 30% to 60%, and about 30% of the resources are not used, resulting in waste. [0003] The unit of the scheduler software pbs and slurm to allocate GPU cards is the number of cards. When scheduling GPU resources, one or more GPU cards are allocated to tasks. When tasks are running, the utilization rate of GPUs is not high, resulting in Some GPU resources are wasted. At...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F9/50
CPCG06F9/5038
Inventor 姬贵阳
Owner SUZHOU LANGCHAO INTELLIGENT TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products