Method and system for distributing data in high-performance computer cluster

一种高性能计算、数据的技术,应用在计算机领域,能够解决作业启动时间长等问题,达到分发时间减少的效果

Inactive Publication Date: 2012-06-06
INT BUSINESS MASCH CORP
View PDF3 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

A job starts from the allocation of resources by the central control node to the calculation of each computing node. This period is called the job startup time. Obviously, the above-mentioned transmission of the network connection table of a job is too long, which will cause the problem that the job startup time is too long

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for distributing data in high-performance computer cluster
  • Method and system for distributing data in high-performance computer cluster
  • Method and system for distributing data in high-performance computer cluster

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] Preferred embodiments of the invention will be described in more detail with reference to the accompanying drawings, in which preferred embodiments of the invention are shown. However, the present invention may be embodied in various forms and should not be construed as limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the invention to those skilled in the art.

[0028] Since a high-performance computing cluster contains a large number of nodes, and the jobs running in the cluster generally use most or all of the nodes in the cluster, a network that includes connection information of all nodes involved in running a job The connection table is usually very large, for example, the example given in the background art is 50M bytes. The transmission of such a large network connection table takes a long time; for a high-performance computing cluster, th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and a system for distributing data in a high-performance computer cluster. The high-performance computer cluster comprises a central control node and M computing nodes, wherein M is an integer more than or equal to 2, and the appointed data is distributed to the M computing nodes by the central control node. The method comprises the following steps: dividing the M computing nodes into m layers, wherein m is the integer more than or equal to 2; dividing the appointed data into k parts, wherein k is the integer more than or equal to 2; distributing the k parts of data to a first layer of computing nodes as sub-nodes of the central control node by the central control node; obtaining at least one part of the data by each computing node in the first layer; distributing the at least one part of data, which is distributed by a father node of the computing nodes, to the sub-computing nodes of the computing nodes; and asking the other computing nodes for the rest of the appointed data by the computing nodes, thereby obtaining all the appointed data. By utilizing the method and the system, the data can be quickly distributed to all the computing nodes in the high-performance computer cluster.

Description

technical field [0001] The present invention generally relates to the computer field, and specifically relates to a method and system for distributing data in a high-performance computing cluster. Background technique [0002] A computer cluster, referred to as a cluster, is a computer system in which multiple computers (also referred to as computing resources, such as software and / or hardware resources) are connected to cooperate to complete computing tasks. These computing resources are located in the same management domain, which has a unified management policy and provides services to users as a whole. Individual computers in a cluster system are often called nodes. Cluster systems have many advantages. For example, when the cluster system works in a load balancing manner, it can achieve higher efficiency by having multiple computers complete the same work. [0003] High-Performance Computing (High-Performance Computing) is a branch of computer science, which is dedic...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F9/46G06F9/50H04L29/08
CPCG06F9/5066G06F9/3891G06F15/161
Inventor 李广磊贺军李获鼎陈琦刘伟
Owner INT BUSINESS MASCH CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products