A fault tolerance cluster system and method based on message log

A cluster system and message technology, applied in transmission systems, digital transmission systems, error prevention, etc., can solve problems such as system performance overhead, achieve the effect of increasing load balancing functions, avoiding system overhead, and reducing impact

Active Publication Date: 2008-03-19
ZTE CORP
View PDF0 Cites 21 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The technical problem solved by the present invention is to overcome the defect that the current fault-tolerant method based on message logs overly relies on additional storage devices or computing nodes in the cluster

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A fault tolerance cluster system and method based on message log
  • A fault tolerance cluster system and method based on message log
  • A fault tolerance cluster system and method based on message log

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] The implementation of the technical solutions of the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0041] FIG. 1 is a structural diagram of a cluster system for implementing fault-tolerant functions in the present invention. in:

[0042] There are m processes running in a cluster system containing n computing nodes, and the computing node failures are all stopped by failure. When a node fails, other nodes can detect its failure immediately. A process running on a node can be described as a two-tuple: P=(pm, bk), where pm and bk represent the major version and minor version of the process, respectively. Since each process in the embodiment of the present invention contains only one corresponding copy, the fault-tolerant model only allows single-point failure. If more process copies are used, the model can be extended to multi-point failure. The fault-tolerant method of the present invention is completely ba...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a fault tolerant cluster system and method based on message log. The system can obviate the system overhead produced by recording synchronous log by storing checkpoint and message log using a backup process and simultaneously recording the checkpoint and message log in a memory on a message sender side without increasing extra reliable equipment. This can not only record the log recording cost but also eliminate the dependency on stable storage medium. Without using any reliable storage equipment for storing checkpoint and log, the invention does not depend on extra spare calculation nodes to replace failure node in recovery period, and the process can continuously run on the rest of nodes without rebooting the process. At the same time, the system can also conveniently increase the load balance function to effectively reduce the influence of node failure on the entire system.

Description

technical field [0001] The invention relates to a fault-tolerant cluster system and method in the computer field, and in particular provides an efficient fault-tolerant system and method based on message logs for cluster environments without reliable storage devices and standby computing nodes. Background technique [0002] With the rapid development of network and computing technologies, network services and application services have become more and more complex and large, making cluster systems widely used. These cluster systems often contain a large number of computing nodes, which are very prone to frequent local failures. Without a fault-tolerant method, it is difficult for the cluster system to ensure long-term normal operation. Saving the process state and inter-process communication messages is an effective fault-tolerant means. When the cluster system encounters a failure, calling the previously saved checkpoint and message log can help the process recover to its st...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): H04L12/24H04L12/26H04L1/22
Inventor 王继刚谢世波李翌
Owner ZTE CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products