Cluster job dispatching method and system

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A job scheduling and clustering technology, applied in the field of biological information, can solve the problems of inefficient operation of analysis tools, waste of cluster computing resources, and characteristics of tool operation.

Active Publication Date: 2016-01-20

INSPUR BEIJING ELECTRONICS INFORMATION IND

View PDF4 Cites 1 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0003] However, each tool in the Galaxy bioinformatics analysis platform has corresponding operating characteristics and is suitable for running on different types of nodes. However, it is currently impossible to submit jobs to a specific node according to the operating characteristics of each tool, resulting in The operating efficiency of analysis tools is not high, and at the same time, it causes a waste of cluster computing resources

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0036] The invention provides a cluster job scheduling method, figure 1 A flow chart of Embodiment 1 of the cluster job scheduling method of the present invention is shown, including:

[0037] Step S101: Detect configuration information of all nodes in the high-performance cluster;

[0038] First, in the high-performance cluster, install and configure the PBS job scheduling system, install the Galaxy bioinformatics analysis platform, and install analysis tools such as gene sequencing tools to detect the configuration information of all nodes in the HPC high-performance cluster, including the number of CPUs, CPU frequency, and memory size and other configuration information, such as detecting the configuration information of node1 in the cluster, and detecting that the configuration of node1 is: fat node, high frequency, and large memory.

[0039] Step S102: In the PBS job scheduling system, according to the detected configuration information of each of the nodes, mark the cor...

Embodiment 2

[0050] The present invention also provides a cluster job scheduling system, Figure 4 A schematic structural diagram of Embodiment 2 of the cluster job scheduling system of the present invention is shown, including:

[0051] Configuration information acquisition module 101, configured to detect configuration information of all nodes in the high-performance cluster;

[0052]The node queue attribute marking module 102 is used to mark the corresponding node queue attributes for each of the nodes according to the detected configuration information of each of the nodes in the PBS job scheduling system;

[0053] The tool queue attribute marking module 103 is used to mark the corresponding tool queue attributes for each analysis tool according to the operating characteristics of each analysis tool in the Gxlaxy biological information analysis platform;

[0054] A matching module 104, configured to match the corresponding target node according to the tool queue attribute of the targe...

Embodiment 3

[0058] Figure 5 It shows a schematic structural diagram of Embodiment 3 of the cluster job scheduling system of the present invention, corresponding to Figure 4 ,Also includes:

[0059] The configuration module 100 is used for configuring the PBS job scheduling system and the Galaxy biological information analysis platform in a high-performance cluster.

[0060] The configuration information of the node in this embodiment includes the number of CPUs, CPU main frequency and memory value.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a cluster job dispatching method and system. The method comprises the following steps: detecting configuration information of all nodes in a high-performance cluster; in a PBS job dispatching system, according to the detected configuration information of the nodes, respectively marking corresponding node queue attributes to the nodes; in a Gxlaxy bioinformation analysis platform, according to the operation features of analysis tools, respectively marking corresponding tool queue attributes to the analysis tools; matching corresponding targeted nodes according to the tool queue attributes of the targeted analysis tools; executing the targeted analysis tools on the targeted nodes. The high-performance cluster, the Gxlaxy bioinformation analysis platform and the PBS job dispatching system are combined, the cluster node types are classified, and aiming at the operation characteristics of different analysis tools of the Gxlaxy platform, the corresponding nodes are bound, so that the analysis tools work in the suitable nodes, and the operation efficiency of the analysis tools of the Gxlaxy platform is improved.

Description

technical field [0001] The invention relates to the field of biological information, in particular to a cluster-based job scheduling method and system. Background technique [0002] The traditional Galaxy bioinformatics analysis platform generally simply integrates the Galaxy platform with a high-performance cluster. After each tool in the platform runs, the job is directly submitted to the cluster for operation. [0003] However, each tool in the Galaxy bioinformatics analysis platform has corresponding operating characteristics and is suitable for running on different types of nodes. However, it is currently impossible to submit jobs to a specific node according to the operating characteristics of each tool, resulting in The operating efficiency of analysis tools is not high, and at the same time, it causes a waste of cluster computing resources. Contents of the invention [0004] In view of this, the main purpose of the present invention is to provide a cluster job sch...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G06F9/48

Inventor 王荣廷

Owner INSPUR BEIJING ELECTRONICS INFORMATION IND

Cluster job dispatching method and system

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology