Unlock instant, AI-driven research and patent intelligence for your innovation.

Cluster job dispatching method and system

A job scheduling and clustering technology, applied in the field of biological information, can solve the problems of inefficient operation of analysis tools, waste of cluster computing resources, and characteristics of tool operation.

Active Publication Date: 2016-01-20
INSPUR BEIJING ELECTRONICS INFORMATION IND
View PDF4 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, each tool in the Galaxy bioinformatics analysis platform has corresponding operating characteristics and is suitable for running on different types of nodes. However, it is currently impossible to submit jobs to a specific node according to the operating characteristics of each tool, resulting in The operating efficiency of analysis tools is not high, and at the same time, it causes a waste of cluster computing resources

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cluster job dispatching method and system
  • Cluster job dispatching method and system
  • Cluster job dispatching method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0036] The invention provides a cluster job scheduling method, figure 1 A flow chart of Embodiment 1 of the cluster job scheduling method of the present invention is shown, including:

[0037] Step S101: Detect configuration information of all nodes in the high-performance cluster;

[0038] First, in the high-performance cluster, install and configure the PBS job scheduling system, install the Galaxy bioinformatics analysis platform, and install analysis tools such as gene sequencing tools to detect the configuration information of all nodes in the HPC high-performance cluster, including the number of CPUs, CPU frequency, and memory size and other configuration information, such as detecting the configuration information of node1 in the cluster, and detecting that the configuration of node1 is: fat node, high frequency, and large memory.

[0039] Step S102: In the PBS job scheduling system, according to the detected configuration information of each of the nodes, mark the cor...

Embodiment 2

[0050] The present invention also provides a cluster job scheduling system, Figure 4 A schematic structural diagram of Embodiment 2 of the cluster job scheduling system of the present invention is shown, including:

[0051] Configuration information acquisition module 101, configured to detect configuration information of all nodes in the high-performance cluster;

[0052]The node queue attribute marking module 102 is used to mark the corresponding node queue attributes for each of the nodes according to the detected configuration information of each of the nodes in the PBS job scheduling system;

[0053] The tool queue attribute marking module 103 is used to mark the corresponding tool queue attributes for each analysis tool according to the operating characteristics of each analysis tool in the Gxlaxy biological information analysis platform;

[0054] A matching module 104, configured to match the corresponding target node according to the tool queue attribute of the targe...

Embodiment 3

[0058] Figure 5 It shows a schematic structural diagram of Embodiment 3 of the cluster job scheduling system of the present invention, corresponding to Figure 4 ,Also includes:

[0059] The configuration module 100 is used for configuring the PBS job scheduling system and the Galaxy biological information analysis platform in a high-performance cluster.

[0060] The configuration information of the node in this embodiment includes the number of CPUs, CPU main frequency and memory value.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a cluster job dispatching method and system. The method comprises the following steps: detecting configuration information of all nodes in a high-performance cluster; in a PBS job dispatching system, according to the detected configuration information of the nodes, respectively marking corresponding node queue attributes to the nodes; in a Gxlaxy bioinformation analysis platform, according to the operation features of analysis tools, respectively marking corresponding tool queue attributes to the analysis tools; matching corresponding targeted nodes according to the tool queue attributes of the targeted analysis tools; executing the targeted analysis tools on the targeted nodes. The high-performance cluster, the Gxlaxy bioinformation analysis platform and the PBS job dispatching system are combined, the cluster node types are classified, and aiming at the operation characteristics of different analysis tools of the Gxlaxy platform, the corresponding nodes are bound, so that the analysis tools work in the suitable nodes, and the operation efficiency of the analysis tools of the Gxlaxy platform is improved.

Description

technical field [0001] The invention relates to the field of biological information, in particular to a cluster-based job scheduling method and system. Background technique [0002] The traditional Galaxy bioinformatics analysis platform generally simply integrates the Galaxy platform with a high-performance cluster. After each tool in the platform runs, the job is directly submitted to the cluster for operation. [0003] However, each tool in the Galaxy bioinformatics analysis platform has corresponding operating characteristics and is suitable for running on different types of nodes. However, it is currently impossible to submit jobs to a specific node according to the operating characteristics of each tool, resulting in The operating efficiency of analysis tools is not high, and at the same time, it causes a waste of cluster computing resources. Contents of the invention [0004] In view of this, the main purpose of the present invention is to provide a cluster job sch...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F9/48
Inventor 王荣廷
Owner INSPUR BEIJING ELECTRONICS INFORMATION IND