Unlock instant, AI-driven research and patent intelligence for your innovation.

A stateful cluster recovery method, device, equipment and readable storage medium

A recovery method and state technology, applied in digital transmission systems, data exchange networks, electrical components, etc., can solve problems such as agent incompatibility, high coupling between implementation and business modules, and complex pacemaker state machines

Active Publication Date: 2021-07-13
HANGZHOU DT DREAM TECH
View PDF9 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Disadvantage 1: pacemaker itself relies on corosync, and the latter is less stable when the network is turbulent, which also increases the complexity of the system
[0005] Disadvantage 2: The agent needs to be configured when pacemaker manages each business module, and the implementation of each agent is different. When the version of the business module is upgraded, the agent may not be compatible
[0006] Disadvantage 3: pacemaker is suitable for the native startup mode of business modules, but for containerized business modules, pacemaker is powerless
[0007] Disadvantage 4: The pacemaker state machine is complex, and the implementation of the agent and the business module are highly coupled, making maintenance difficult

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A stateful cluster recovery method, device, equipment and readable storage medium
  • A stateful cluster recovery method, device, equipment and readable storage medium
  • A stateful cluster recovery method, device, equipment and readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0059] Please refer to figure 2 , figure 2 It is a flowchart of a stateful cluster recovery method in an embodiment of the present invention, the method can be applied to each node in the stateful cluster, and the method includes the following steps:

[0060] S101. After the target node restarts, obtain the identity identification file recorded by the distributed coordination service.

[0061] It should be noted that the target node in this embodiment of the present invention may be any target node in the stateful cluster. The reason for the restart of the target node can be power-off restart, fault restart, or other planned restarts. In addition, the target node restart can be a single-point restart, or it can be each node when the entire stateful cluster is restarted as a whole. That is, when the stateful cluster restarts, each node can be regarded as a target node, and executes the recovery method provided by the embodiment of the present invention. Among them, statefu...

Embodiment 2

[0097] In order to facilitate those skilled in the art to better understand the technical solution provided by the embodiment of the present invention, the technical solution provided by the embodiment of the present invention will be described in detail below taking the distributed coordination service specifically ETCD as an example.

[0098] It should be noted that the premise of this embodiment is to cancel the load balancing function of SLB, and all business processing falls on the master node, which also has certain requirements on the overall system scale and business pressure, and is only suitable for small and medium-sized business architectures . Each business can be automatically started after abnormal exit through systemd or docker. When the cluster is restarted, the previous master node must be able to start normally. For example, the cluster contains nodes 1, 2, and 3, of which 1 is the master node. After the cluster restarts as a whole, node 1 must be in place, ...

Embodiment 3

[0148] Corresponding to the above method embodiment, the embodiment of the present invention also provides a stateful cluster recovery device. The stateful cluster recovery device described below and the stateful cluster recovery method described above can be referred to in correspondence.

[0149] see Figure 10 As shown, the device includes the following modules:

[0150] The identification file obtaining module 101 is used for obtaining the identification file recorded by the distributed coordination service after the target node is restarted;

[0151] The identification judging module 102 is used to determine the main node identification by using the identification file, and judge whether the main node identification is the same as the local identification;

[0152] The main identity determination module 103 is used to obtain the distributed lock of the distributed coordination service if it is, and set the VIP of the stateful cluster to provide external access services i...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for recovering a stateful cluster. The method comprises the following steps: after the target node is restarted, an identity file of a distributed coordination service record is acquired; the identity file of the master node is used to determine the identity mark of the master node, and the identity mark of the master node is judged Whether it is the same as the local machine ID; if yes, obtain the distributed lock of the distributed coordination service, and set the VIP of the stateful cluster to provide external access services in the local network card; if not, after the master node obtains the distributed lock , join the stateful cluster as a slave node, and join the queue for applying for the master identity. Data integrity of stateful clusters can be guaranteed when the stateful cluster is running, when the cluster is restarted, or when a single node is restarted. The invention also discloses a stateful cluster recovery device, equipment and readable storage medium, which have corresponding technical effects.

Description

technical field [0001] The invention relates to the field of computer application technology, in particular to a stateful cluster recovery method, device, equipment and readable storage medium. Background technique [0002] In cloud computing, big data, artificial intelligence and other IT systems, there are many key services that store the core data of the business, and their normal operation is the prerequisite for the stable operation of the system. The redundant backup method of each node forms a cluster to provide external services uniformly. These services with mutable data are called stateful services. For example, the mariadb cluster of Galera technology as a database service, the active and standby cluster of ovn-db, and the active and standby cluster of mongo, such as the active and standby cluster of rabbitmq-server as a message forwarding service. When the node providing the service is abnormal (such as power outage, network abnormality), the services of other ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): H04L29/08H04L12/24H04L29/06H04L29/12
CPCH04L41/0668H04L41/50H04L41/5041H04L69/163H04L61/5007H04L67/51H04L67/01
Inventor 杜鹏飞
Owner HANGZHOU DT DREAM TECH