Method for setting checkpoints and recovering failure process based on message number checking and non-blocking

A process recovery and checkpoint technology, applied in digital transmission systems, data exchange networks, electrical components, etc., can solve problems such as loss

Inactive Publication Date: 2011-03-16
SHANDONG UNIV
View PDF4 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Usually, when a process of the synchronous or asynchronous checkpoint algorithm fails, all processes fall back to the previously saved checkpoint to re-execute, and the resulting loss is the double calculation caused by the rollback

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for setting checkpoints and recovering failure process based on message number checking and non-blocking
  • Method for setting checkpoints and recovering failure process based on message number checking and non-blocking
  • Method for setting checkpoints and recovering failure process based on message number checking and non-blocking

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0154] The present invention will be further described below in conjunction with embodiment.

[0155] In engineering science such as computer science, the occurrence of a situation or an activity can be called an event. For this reason, the distributed system is regarded as an event system, that is, the system undergoes state transition and generates corresponding events driven by events. operate.

[0156] According to the impact of events on distributed systems, events can be divided into two types: input and output:

[0157] 1. Input event, from the event corresponding to the internal or external input operation of the process. The input events of the distributed system either come from the process itself or from the external environment of the process, such as other processes; such events not only affect the state transition of the process itself, but also may affect the state changes of other processes.

[0158] Input events are further divided by their source into:

[...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for setting checkpoints and recovering a failure process based on message number checking and non-blocking, which is used for solving the problems of overlarge information control expenditures and excessive loss in the failure process owing to rollback calculation in the establishment process of the checkpoints of the previous distributed system. For the problems of excessive number of control messages and process blockage during the establishment process of the checkpoints of the distributed system, the invention adopts quantitative analytical judgment of the message number instead of the qualitative analytical judgment based on causal relationship in the previous studies, provides the method for setting the synchronous checkpoints and recovering the collapse process based on an extended model of the distributed system and message number checking and non-blocking, really realizes non-blocking operation of all the processes in the course of setting the checkpoints, and really realizes the minimum system rollback for rollback recovery of the collapse process when a certain process collapses, thus achieving the purpose of improving the performance of the distributed system.

Description

technical field [0001] The invention relates to a method for checking non-blocking checkpoint setting and failure process recovery based on the number of messages. Background technique [0002] Logically, a distributed system can be regarded as consisting of several independent and complexly interacting processes. Such processes complete inter-process communication through message exchange to share system resources and cooperate with each other to complete a task. Common faults in distributed systems mainly include: faulty faults, omission faults, timing faults, response faults, and random faults. There are two solutions based on hardware and software for fault handling methods, and the software solutions mainly include active replication and passive replication. Passive replication can use two methods: forward recovery and backward recovery: forward recovery assumes that the nature of the fault can be accurately obtained and such faults can be eliminated so that the system...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): H04L12/24H04L12/26H04L12/56H04L29/08H04L12/801
Inventor 高胜法左凤艳石传青刘冬波
Owner SHANDONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products