Failure detection method and device for nodes in cluster system

A cluster system and fault detection technology, applied in the field of communication, can solve the problem of long cycle of node fault detection

Active Publication Date: 2017-01-04
HUAWEI TECH CO LTD
View PDF9 Cites 33 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The embodiment of the present invention provides a node fault detection method and device in a cluster system, which is used to solve the problem in the prior art that node f

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Failure detection method and device for nodes in cluster system
  • Failure detection method and device for nodes in cluster system
  • Failure detection method and device for nodes in cluster system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0075] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0076] The embodiment of the present invention is applicable to a cluster system, and it is specifically applicable to the scene of node fault detection in a distributed cluster system. The distributed cluster system includes at least two nodes, and the nodes may be computers, for example. Optionally, the difference between the nodes in the clust...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides a failure detection method and device for nodes in a cluster system. The method comprises the following steps that: a first node judges whether a first heartbeat message sent by a second node is received in preset time or not, wherein the first node is a neighbor node of the second node, and the first heartbeat message is a heartbeat message which is sent to each neighbor node of the second node in parallel by the second node; under the condition that the first node does not receive the heartbeat message sent by the second node, the first node sends a request message to other neighbor nodes other than the first node in all the neighbor nodes of the second node; the first node receives response messages carrying reception states sent by the other neighbor nodes; and under the condition that the first node determines that other neighbor nodes do not receive the heartbeat message according to the reception states, the first node determines that the second node fails. Through adoption of the failure detection method and device for the nodes in the cluster system, the node failure detection efficiency can be increased.

Description

technical field [0001] Embodiments of the present invention relate to communication technologies, and in particular to a method and device for detecting node faults in a cluster system. Background technique [0002] In a distributed cluster system, it usually includes a central node and multiple ordinary nodes. When the central node or ordinary nodes fail, it will have a great impact on the reliability of the distributed cluster system. Therefore, how to effectively implement node fault detection is very important. [0003] figure 1 is a schematic diagram of a node fault detection method in the prior art, such as figure 1 As shown, the ordinary nodes (B, C, D, E) send heartbeat messages to the central node (M) according to the heartbeat period, and the central node (M) detects the common Whether the node is faulty, wherein, one detection cycle can include multiple heartbeat cycles. At the same time, the central node (M) can also periodically send heartbeat messages to or...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): H04L12/24H04L12/26
CPCH04L41/0631H04L43/10H04L41/00
Inventor 胡琳伍湘平彭佩星
Owner HUAWEI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products