Cluster monitoring and early warning method

A cluster, real-time monitoring technology, applied in the direction of digital transmission system, electrical components, transmission system, etc., can solve the problem of not supporting single-point failure processing, not supporting single-point failure processing, and the lack of generality of cluster monitoring technology.

Inactive Publication Date: 2012-10-31
CHINA UNIV OF PETROLEUM (EAST CHINA)
View PDF5 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, Ganglia does not support the handling of single point of failure, that is, when the server fails, manual handling is required
At the same time, due to the rapid development of the Internet in recent years, the size of the cluster has far exceeded 2000 nodes, and the monitoring performance of Ganglia cannot get a timely response as the scale of the cluster expands.
At present, the cluster monitoring technology is designed for a specific cluster platform, which leads to the fact that the cluster monitoring technology does not have certain versatility. At the same time, the traditional monitoring technology does not support single-point failure processing and cannot provide early warning solutions.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cluster monitoring and early warning method
  • Cluster monitoring and early warning method
  • Cluster monitoring and early warning method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] The present invention will be further described below in conjunction with an example of an oilfield research institute having a cluster scale of 2000 nodes.

[0047] (1) Fleet grouping

[0048] According to the formula (1), the fleet is divided into (group), then according to formula (2), get the number of nodes in each group The number of redundant nodes is evenly distributed to random groups. Each group has a server called Group. There are 20 Groups and 20 Secondary Groups in the cluster. The remaining nodes are used as ControlNodes. All the nodes below are responsible for collecting the static information in Table 1 and the dynamic information in Table 2 by the Agent.

[0049] (2) Environment construction

[0050] According to the above grouping method, the following specific environment is obtained:

[0051] ControlNode (1): cp2001, IP address: 168.173.2.1

[0052] Group (20): cp2002~cp2021, IP address: 168.173.2.2~168.173.2.21

[0053] SecondaryGroup (20): ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a cluster monitoring and early warning method. The grouping mechanism is adopted to adapt to clusters in different scales and respond to large-scale clusters in real time, a topological structure is adopted to solve Group single point of failures, and monitoring and early warning are combined to monitor the clusters in real time. Data collected by a monitor are analyzed in real time and compared with performance index of a system, and once a certain datum is found to surpass the threshold of the performance index, the datum is sent to a user in short message mode to inform the user to solve failures timely.

Description

Technical field: [0001] The invention relates to a cluster monitoring and early warning method, especially adopting a grouping mechanism to adapt to clusters of different scales and real-time response to large-scale clusters, and using a topology structure to solve single-point failures of Groups, and adopting a method of combining monitoring and early warning To achieve the purpose of real-time monitoring of the user cluster. Background technique: [0002] In the traditional cluster monitoring system, the open source project Ganglia has achieved a good monitoring of the cluster size with 2000 nodes. Ganglia is a cross-platform scalable distributed monitoring system under high-performance computing systems. It is based on a hierarchical design, utilizing well-designed data structures and algorithms to achieve low concurrency between nodes. However, Ganglia does not support the handling of single point of failure, that is, when the server fails, manual handling is required....

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): H04L12/24
Inventor 俞辉高传俊
Owner CHINA UNIV OF PETROLEUM (EAST CHINA)
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products