An Iterator Based on Optimistic Fault Tolerance

An iterator and iterative technology, applied in the field of distributed iterative computing, can solve problems such as large overhead, no comprehensive consideration of the size of the iterative task and task failure rate, and the impact of iterative computing efficiency, etc., to achieve the best fault-free performance, The effect of reducing the delay of calculation and improving the efficiency of iterative processing

Active Publication Date: 2021-04-02
NORTHEASTERN UNIV LIAONING +1
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Therefore, when the existing iterator has a small amount of tasks or a low probability of task failure, the checkpoint will bring a large additional overhead, which affects the real iterative computing efficiency.
In addition, only pessimistic fault-tolerant strategies are used in existing iterators, and there is no fault-tolerant scheme that comprehensively considers the size of iteration tasks and task failure rate

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • An Iterator Based on Optimistic Fault Tolerance
  • An Iterator Based on Optimistic Fault Tolerance
  • An Iterator Based on Optimistic Fault Tolerance

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] The specific implementation manners of the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. The following examples are used to illustrate the present invention, but are not intended to limit the scope of the present invention.

[0052] In this embodiment, an iterator based on an optimistic fault-tolerant method is applied to Gaia, a big data analysis system. Gaia is a real-time stream processing system, and its fault-tolerant mechanism is realized by periodically creating snapshots of distributed data streams. These snapshots act as a fallback consistency checkpoint when the system encounters a failure, and resend the data after the snapshot from the data source. Although the fault-tolerant mechanism based on checkpoints is relatively simple to implement, frequent checkpoint settings increase the delay of calculations, especially for more complex tasks such as iterative calculations. This fault-t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an iterator based on an optimistic fault tolerant method. The invention belongs to the technical field of distributed iterative computation in a big data environment. The iterator comprises an incremental iterator and a batch iterator, comprehensively considers iterative tasks of different sizes and iterative computation tasks of different failure rates, introduces a compensation function, and enables a system to use the function to re-initialize lost partitions. And when a fault occurs, the system suspends the current iteration, ignores the failed task, reallocates thelost calculation to the newly obtained node, and calls the compensation function on the partition to recover the consistent state and recover execution. And for the condition of relatively low faultfrequency, the calculation delay is greatly reduced, and the iterative processing efficiency is improved. For the condition of higher fault frequency, the iterator can ensure that the iterative processing efficiency is not lower than that of the iterator before optimization. The iterator based on an optimistic fault tolerant method does not need to add additional task operation, thus effectively reducing the fault-tolerant overhead.

Description

technical field [0001] The invention relates to the technical field of distributed iterative computing in a big data environment, in particular to an iterator based on an optimistic fault-tolerant method. Background technique [0002] With the rapid development of Internet technology, iterative processing and analysis of massive data has great application value. With the continuous increase of computing nodes in distributed systems, the failure rate also increases. The design of iterators and fault-tolerant methods is one of the main bottlenecks restricting computing efficiency, and it is a key technology that cannot be ignored. [0003] In the prior art, when the iterator processes batch tasks, if a node fails or a task fails, the system will restart the iterative task according to the feature that the file can be accessed repeatedly. When processing stream computing tasks, fault tolerance is implemented based on distributed snapshots and partially resendable data sources...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F11/07G06F9/448
CPCG06F11/0793G06F9/448
Inventor 郭文鹏赵宇海王国仁季航旭李博扬
Owner NORTHEASTERN UNIV LIAONING
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products