Cloud platform failure recovery method and system

A fault recovery and cloud platform technology, applied in the field of cloud computing, can solve problems such as system complexity and achieve the effect of low latency requirements

Active Publication Date: 2013-09-11
四川电子科技大学教育发展基金会
View PDF4 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, there may be many reasons for "soft" failures. For example, slow execution may be due to server hardware failures, network failures, disk failures, operating system software failures, etc. Checking one by one will make the system too complicated

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cloud platform failure recovery method and system
  • Cloud platform failure recovery method and system
  • Cloud platform failure recovery method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] The technical solutions of the present invention will be further described below in conjunction with the accompanying drawings and through specific implementation methods.

[0037] The flow of the first embodiment of a cloud platform failure recovery method of the present invention is as follows figure 1 Shown:

[0038] In step 101, fault detection is performed on the server according to task execution time and disk read / write speed.

[0039] Step 102, calculate the server failure rate of the system based on the result of the failure detection.

[0040] Step 103, if the failure rate of the server is less than the preset threshold, the server is automatically restored, otherwise, the server is prohibited from being automatically restored.

[0041] This embodiment proposes a cloud platform fault recovery method. By detecting the server and disk access time, a copy backup is added for the problem disk, so as to realize the hardware reliability at a macro level and preven...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a cloud platform failure recovery method and system. The cloud platform failure recovery method includes the steps that fault detection is carried out on a server through task execution time and magnetic disk read-write speed, the server failure rate of the system is calculated through the result of fault detection, if the server failure rate is smaller than a preset threshold value, the server is recovered automatically, otherwise, the server is prohibited from automatic recovery. By means of the cloud platform failure recovery method and system, due to the facts that hardware failures are detected based on the macro phenomenon and effective strategies of automatic failure recovery are carried out, the macroscopic hardware reliability of a cloud platform is guaranteed, and the problems related to hardware are prevented from exerting adverse effects on user experience.

Description

technical field [0001] The present invention relates to the field of cloud computing, in particular to a cloud platform failure recovery method and system. Background technique [0002] With the rapid development of cloud platforms, the large-scale application of cloud platforms has also entered our field of vision. With the expansion of cloud computing scale, unreliable hardware has become the most basic challenge of cloud platform. After the cluster scale reaches thousands of units, small probability events on a single machine become inevitable and frequent events. Downtime caused by failures of hard disks, hard disk controllers, CPUs, memory, motherboards, power supplies, etc. occurs every day. This type of hardware failure failure, we call it a "hard" failure (fail-stop failure). In addition, there is a class of failures that are less obvious and called "soft" failures, e.g. disks are accessible but only 1 / 10th as fast as normal, servers are not down but programs run ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): H04L12/24H04L29/08
Inventor 戴元顺
Owner 四川电子科技大学教育发展基金会
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products