Method and device for managing container resources in cluster

A resource management and container technology, applied in the field of computing clusters, can solve problems such as reduced resource utilization, low operating efficiency, and container coexistence, and achieve the effects of improving independence, throughput, and efficiency

Pending Publication Date: 2022-07-22
杭州幻方人工智能基础研究有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in the existing technology, for nodes that use forced deletion to release resources, there is a certain lag time from when the forced deletion occurs to when the real container is deleted and the resources are released.
During system resource scheduling, this lag time will lead to container coexistence problems. Newly started containers need to compete for resources with containers that have not been actually deleted, resulting in low operating efficiency and may even cause serious problems such as insufficient resources.
If the container is actually deleted before scheduling, it will also lead to low efficiency of the scheduling system and lower resource utilization.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for managing container resources in cluster

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0020] The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only a part of the embodiments of the present invention, rather than all the embodiments. The components of the embodiments of the invention generally described and illustrated in the drawings herein may be arranged and designed in a variety of different configurations.

[0021] In a specific embodiment, the scheduling trigger time may be set to 1 second.

[0022] like figure 1 As shown, a container resource management method in a cluster is as follows: the following operation steps are triggered cyclically:

[0023] S101 presets a seat for each node in the cluster to mark the number of resources that the node can use to create a new training container.

[0024] The preset seat is used to mark the number of resources that the n...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a method for managing container resources in a cluster. The method comprises the following steps: (1) presetting a seat for each node in the cluster; (2) after each round of scheduling, if the task running on the node needs to be interrupted and suspended, executing forced deletion on the task; (3) performing forced deletion on the task execution of the scheduling program, and enabling all nodes which terminate task operation to serve as idle available nodes to enter a new round of scheduling; (4) acquiring the existence state of a container in a single node every other seat monitoring trigger time, and updating the seat state in real time; (5) applying for resources of a seat when a container is started for a node needing to execute a new training task in a new round of scheduling, and waiting for the seat resources if no seat resources exist; and (6) creating a new container, starting a new training task, and changing the seat state. The efficiency of the scheduling system can be improved, risks caused by coexistence of training containers are eliminated, and meanwhile the independence of the scheduling module is improved.

Description

technical field [0001] The invention relates to the field of computing clusters, in particular to a method and device for managing container resources in a cluster. Background technique [0002] With the development of cloud computing technology, complex models need to be trained using large-scale computing clusters, and Kubernetes container technology is usually used for large-scale deployment of training environments. The process of model training needs to consider the problem of computing power allocation and task scheduling. The invention application (patent application number: 2021114594024) discloses a method for cluster management task scheduling. Allocate available nodes for tasks with running permission. For tasks that have not obtained running permission: if they are in a non-running state, suspend them and continue to wait; if they are already running, they need to be suspended and the node resources occupied by them need to be released. . However, in the prior ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F9/455G06F9/48G06F9/50
CPCG06F9/45558G06F9/4881G06F9/5016G06F9/5022G06F9/5038G06F9/5077G06F2009/45583G06F2009/45595G06F2009/45562
Inventor 徐进黄盼盼张浩威
Owner 杭州幻方人工智能基础研究有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products