A scheduling method based on backup task running time estimation in hadoop big data platform
A big data platform and backup task technology, which is applied in the scheduling field based on backup task running time estimation, can solve problems such as insufficient efficiency of backup task scheduling due to estimation accuracy, meaningless backup task speculative execution mechanism, invalid backup task allocation and operation, etc. , to achieve the effect of shortening the operation turnaround time, increasing the reliability and improving the efficiency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0054] The preferred embodiments of the present invention will be described in detail below with reference to the accompanying drawings.
[0055] figure 1 It is a macro flow chart of the scheme of the present invention, figure 2 It is the flow chart of the scheduling method based on backup task running time estimation of the present invention, as shown in the figure, the scheduling method based on backup task running time estimation in the Hadoop big data platform of the present invention mainly includes the following seven steps: Step 1: Determine whether the task process entity TaskTracker in the Job (job) on the JobTracker node, that is, the task requester, is a slow node; Step 2: Check whether the number of tasks already started in the Job (job) on the JobTracker node exceeds Threshold; step 3: filter out all tasks that meet the conditions in the Job (job), and save them in the candidates table, calculate the remaining time leftTime of the task according to LATE (the lon...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


