Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Online identification method and system for high-frequency consecutive failure tasks in cloud computing system

An identification method and cloud computing technology, applied in the field of cloud computing, can solve problems such as system resource waste, increase cluster scheduler load, cloud computing system harm, etc., and achieve the effect of avoiding resource waste and scheduling load, and improving reliability and availability.

Active Publication Date: 2015-12-23
PEKING UNIV
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the existing technology does not have an effective method for identifying high-frequency continuous failure tasks
Although high-frequency continuous failure tasks will be rescheduled by the system immediately after each failure, they cannot be quickly recovered by restarting, but will fail repeatedly after repeated scheduling.
Repeated failures not only cause a lot of waste of system resources, but also increase the load of the cluster scheduler, bringing potential harm to the cloud computing system, making it difficult to meet the high availability requirements of the cloud computing system

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Online identification method and system for high-frequency consecutive failure tasks in cloud computing system
  • Online identification method and system for high-frequency consecutive failure tasks in cloud computing system
  • Online identification method and system for high-frequency consecutive failure tasks in cloud computing system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0069] Below in conjunction with accompanying drawing, further describe the present invention through embodiment, but do not limit the scope of the present invention in any way.

[0070] figure 1 It is a flow chart of an online identification method for high-frequency continuous failure tasks in a cloud computing system provided by an embodiment of the present invention, figure 2 It is a block diagram of the structure and system data processing flow chart of the online identification system for high-frequency continuous failure tasks in the cloud computing system provided by the embodiment of the present invention. The following is a specific example to illustrate the process of performing the operation of the method provided by the present invention:

[0071] 1) First, the ETL module reads the offline monitoring data from the offline data source, and converts the data into a specific data structure;

[0072] Data can be read through the API provided by the system. For fil...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an online identification method and system for high-frequency consecutive failure tasks in a cloud computing system. Off-line analysis and study are performed on the basis of time series according to off-line monitoring data, so that the failure frequency threshold capable of representing all the failure frequency characteristics of the non-high frequency consecutive failure tasks at a certain confidence level is obtained; then, identification is carried out, so that high frequency consecutive failure tasks in the online data are obtained. According to the invention, the tasks in the cloud computing system are analyzed from the perspective of events and resources, so that the occurring frequency of a failure event and system resources consumed by the tasks in a time period are obtained; through the analysis of the failure frequency characteristics of the tasks and the resource use time series model, the high frequency consecutive failure tasks which fail repeatedly and are hard to repair in the cloud computing system are identified in real time, and the cloud computing system is informed of adopting a proactive failure recovery measure in advance, so that system resources are saved, and the reliability and availability of the cloud computing system are improved.

Description

technical field [0001] The invention belongs to the technical field of cloud computing, and in particular relates to an online identification method and system for high-frequency continuous failure tasks in a cloud computing system. Background technique [0002] With its on-demand consumption model, cloud computing is gradually widely used in various fields such as finance and business. The high availability of the system in the cloud computing environment is increasingly becoming the key to the maturity of cloud computing technology. However, due to the gradual expansion of the scale and heterogeneity of the cloud computing system, various types of failures frequently occur in the cloud computing system, which has become one of the key factors that threaten the availability and reliability of the cloud computing system. In a cloud computing system, a task, as the smallest scheduling unit running on a single node, is the basic guarantee for the normal execution of user appli...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F11/00
Inventor 李影唐红艳贾统吴中海张齐勋
Owner PEKING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products