Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method, device and system for handling failures in at least one distributed cluster

A distributed cluster and fault technology, applied in the transmission system, digital transmission system, data exchange network, etc., can solve the problem of long detection time of Master node faults

Active Publication Date: 2019-06-21
HUAWEI TECH CO LTD
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, there are the following disadvantages in the above fault detection. The Salve node usually considers that the Master node has failed when it has judged that it has not received the heartbeat message from the Master node for many times, and then initiates the process of determining a new node in the cluster. The election strategy of the Master node, therefore, in the current technology, the failure detection time of the Master node is too long

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method, device and system for handling failures in at least one distributed cluster
  • Method, device and system for handling failures in at least one distributed cluster
  • Method, device and system for handling failures in at least one distributed cluster

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0136] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0137] In order to facilitate the understanding of the data processing solutions provided by the embodiments of the present invention, the following concepts are first introduced:

[0138] heartbeat message

[0139] A Heartbeat Message is a message sent by a source to a receiver that allows the receiver to determine if and when the source fails or terminates. Usually, the heartbeat message is sent from the start of the sending source until the sending...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Embodiments of the present invention provide a method, device and system for handling faults in at least one distributed cluster, at least one distributed cluster includes a first distributed cluster, and the first distributed cluster includes a first Master node and a first Slave A node, a first reference node, and a first standby node serving as a backup of the first Master node, the first standby node receiving a message sent by the first reference node including a message indicating that the first reference node and the first Master node are in a disconnected state The heartbeat message of the first indication information; the first backup node determines that the first reference node and the first Master node are in a disconnected state according to the first indication information; the first backup node sends a message to the first backup node after detecting that the first Master node In the case that the heartbeat message is interrupted, it is determined that the first backup node is also in a disconnected state with the first Master node; the first backup node determines that the first Master node has failed. In the embodiment of the present invention, the fault detection time can be effectively shortened.

Description

technical field [0001] Embodiments of the present invention relate to the field of cluster management, and more specifically, relate to a method, device and system for handling faults in at least one distributed cluster. Background technique [0002] At present, most high availability (High Available, referred to as "HA") distributed clusters are usually centralized with one master (Master node) and multiple slaves (Slave nodes). Among them, the Master node sends heartbeat messages to all Slave nodes in the cluster , each Salve node in the cluster also sends a heartbeat message to the Master node. The Slave node judges whether the Master node fails by detecting the heartbeat message sent by the Master node, and the Master node judges whether the Salve node fails by detecting the heartbeat message sent by the Slave node. [0003] However, there are the following disadvantages in the above failure detection. The Salve node usually considers that the Master node has failed whe...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): H04L12/24H04L1/22H04L69/40
CPCH04L43/00H04L67/145H04L69/40H04L65/40H04L41/0668H04L41/0677H04L43/0817H04L43/10
Inventor 袁健清倪绍基
Owner HUAWEI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products