A Distributed Task Assignment Method

A distributed task and allocation method technology, applied in the field of task allocation, can solve problems such as task failure, waste of computing resources, dynamic adjustment, etc., and achieve the effect of preventing the generation of failed tasks, reducing running time delay, and ensuring execution efficiency

Active Publication Date: 2019-04-09
CHINA UNIONPAY
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although this can solve the above problems to a certain extent, the configuration value is fixed and cannot be dynamically adjusted according to the needs of different tasks, which may cause a waste of computing resources
If the configuration value is too low, the task will still fail due to insufficient disk space; if the configuration value is too high, the compute nodes will not be fully utilized

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Distributed Task Assignment Method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] Introduced below are some of the various embodiments of the invention, intended to provide a basic understanding of the invention. It is not intended to identify key or critical elements of the invention or to delineate the scope of protection.

[0036] A distributed job is usually divided into two types of tasks: slice parallel task (map) and merge reduction task (reduce). Each type of task will have several identical subtasks. Subtasks of the same type usually have the same input data split size (split). For example, most of the multi-cluster distributed jobs now use the Hadoop framework. In the Hadoop framework, distributed tasks are divided into two stages, the map stage and the reduce stage. Therefore, there are correspondingly two types of subtasks, the map subtask (sharding the data) and the reduce subtask (reducing the sliced ​​data), and the reduce phase can only be entered after the map phase is completed. The input data slice size of the same type of subtas...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a distributive task allocation method, which comprises the following steps that: a job is subjected to initialization decomposition to obtain a plurality of tasks, and a main control node builds a task operation state table for the tasks; the first task is allocated to a calculation node, and meanwhile, a backup task identical to the first task is allocated to a prediction node; the prediction node calculates the disc space required by the backup task, and feeds back the calculation result to the main control node; the main control node updates the task operation state table according to the calculation result from the prediction node; before the task is allocated to the calculation node, the main control node predicts the task state of the calculation node and estimates available task operation space according to the predicted task state and the task operation state table; and under the condition that the estimated available task operation space is greater than the space required for the task operation, the task is allocated to the calculation node, and otherwise, the calculation node is re-selected.

Description

technical field [0001] The invention relates to a task allocation method in a distributed computing system, in particular to an optimization method capable of preventing distributed task computing time delays. Background technique [0002] In the prior art, in a distributed computing process, a job (job) is usually decomposed into multiple subtasks, which are assigned by the main control node to multiple computing nodes for parallel computing. When each computing subtask (task) runs on the computing node, it needs to write the intermediate data to the local file system. Usually, when multiple subtasks are running on the same computing node, they can write data to the file system through different disk write points, so as to improve the throughput of the disk when the job is running. [0003] If the disk space where a write point is located is insufficient, the task will be denied access because the disk space is full when the file is written halfway, causing the task to fai...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F9/46
Inventor 王骏赵金涛杨鸿超邱雪涛
Owner CHINA UNIONPAY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products