Fault tolerance method and system used for distributed program

A fault-tolerant system and distributed technology, applied in the field of fault-tolerant methods and systems of distributed programs, can solve the problems of consuming administrator's time and energy, difficult to deal with distributed program status monitoring tasks, etc.

Active Publication Date: 2010-07-14
北京讯鸟软件有限公司
View PDF0 Cites 21 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

But a new problem is that the existing native process status monitoring program has been difficult to cope with the status monitoring task of distributed programs
Therefore, when a distributed program makes an error, the administrator must go to each computer in the distributed computer system to check the error situation of the process separately, which will consume a lot of time and energy for the administrator.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Fault tolerance method and system used for distributed program
  • Fault tolerance method and system used for distributed program
  • Fault tolerance method and system used for distributed program

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0016] Hereinafter, specific embodiments of the present invention will be described with reference to the accompanying drawings.

[0017] Figure 1A is a block diagram of a fault-tolerant system for distributed programs according to the present invention. As shown in Figure 1, the fault-tolerant system includes a fault-tolerant server 10, a fault-tolerant client 20, and a fault-tolerant client 30. All fault-tolerant servers and fault-tolerant clients are connected together through a network, and the network includes but is not limited to a local area network, WAN, etc. And, it should be understood that the configuration described here is only for the purpose of illustration, and the system may include any number of fault-tolerant servers and fault-tolerant clients.

[0018] The fault-tolerant server includes a communication module 102 , a policy specification module 104 , a policy execution module 106 , a policy database 108 , and a process dependency database 110 . The comm...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention aims to provide a fault tolerance method and a system used for a distributed program, which can start program processes disposed on different fault tolerance clients according to a sequence. When any one process with a dependency relationship is collapsed, a fault tolerance server can execute the corresponding operation according to a strategy configured on the fault tolerance server, and the fault tolerance clients can start the processes according to the sequence as long as the fault tolerance clients utilize a traditional monitoring program to execute monitoring, take charge of reporting the process status and receive and execute an instruction transmitted by the fault tolerance server or issued by the operation maintenance personnel so as to ensure that the operation canbe normally carried out. The system is provided with a fault tolerance server and at least one fault tolerance client, wherein the fault tolerance clients monitor the status of the processes of the distributed program run by the fault tolerance clients per se. When the abnormity of the status is monitored, the fault tolerance clients inform the fault tolerance server, and the fault tolerance server utilizes a strategy execution module to execute restarting fault-tolerance treatment according to an automatic restarting strategy or a manual restarting strategy and the interprocess dependency relationship specified in a process dependency relationship form.

Description

technical field [0001] The present application relates to a fault-tolerant method and system, in particular, to a fault-tolerant method and system for distributed programs. Background technique [0002] A monitoring program is a program that monitors the running status of a computer process. When the monitoring program determines that the monitored computer process is not running or has an abnormal situation, the monitoring program tries to start or restart the monitored computer process. Up to now, this kind of monitoring program can only monitor the computer process of the machine, and it operates the processes on the machine in order. If one of the processes is abnormal, it will start the monitored process according to the startup sequence. [0003] However, with the increasing demand for computer operating capabilities, many tasks can no longer be handled by a single computer. In this context, distributed programs emerged as the times require. Because of their advantage...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F11/36
Inventor 不公告发明人
Owner 北京讯鸟软件有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products