Map/Reduce type mass data processing platform-orientated job scheduling method

A job scheduling and platform technology, applied in the direction of program startup/switching, resource allocation, multi-program device, etc., can solve the problem of reducing the utilization rate of Map/Reduce platform resources

Inactive Publication Date: 2015-01-28
BEIJING UNIV OF TECH
View PDF4 Cites 56 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In the existing scheduling method, when the Reduce task is idle and waiting, the computing resources all

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Map/Reduce type mass data processing platform-orientated job scheduling method
  • Map/Reduce type mass data processing platform-orientated job scheduling method
  • Map/Reduce type mass data processing platform-orientated job scheduling method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0111]The present invention will be described below in conjunction with the accompanying drawings and specific embodiments.

[0112] figure 1 It is a system deployment diagram of a Map / Reduce massive data processing platform adopting the method of the present invention. The system adopting the method of the invention can be deployed on a computer cluster. The cluster includes multiple servers (cluster nodes), and the servers are connected through a network. Cluster nodes are divided into two categories, including a management node and multiple computing nodes. The Map / Reduce massive data processing platform adopting the method of the present invention includes four core modules: a job scheduling module, an application management module, a task execution module and a node management module. Among them, the job scheduling module is deployed on the management node; the application management module, node management module and task execution module are all deployed on the compu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a Map/Reduce type mass data processing platform-orientated job scheduling method belonging to the field of mass data processing, and particularly relates to job scheduling and resource management in a Map/Reduce type mass data processing platform. The invention provides a Map/reduce job scheduling method based on Reduce task resource preemption, which realizes that when a Reduce task is in an idle waiting state, computing resources occupied by the Reduce task are distributed to a Map task for being used. The Map/Reduce type mass data processing platform-orientated job scheduling method mainly comprises: preemptible resources, a job scheduling module, an application management module, a task executing module, a node management module, a task suspending and recovering method, and a preemptible-resource-orientated job scheduling method. The Map/Reduce type mass data processing platform-orientated job scheduling method can be applied to job scheduling and resource management of a data center, and realizes reasonable distribution of computing resources of the data center by preemption scheduling of a Reduce task in a Map/Reduce job, thereby improving the computing resource utilization rate and shortening the job running time.

Description

technical field [0001] The invention belongs to the field of massive data processing, and in particular relates to job scheduling and resource management in a Map / Reduce type massive data processing platform. Background technique [0002] The Map / Reduce massive data processing platform is the latest technological development in the field of massive data processing, mainly serving big data applications with a write-once, multiple-read data access mode and an Embarrassing Parallel computing mode. The Map / Reduce platform provides the Map / Reduce parallel computing model and its corresponding runtime environment. The Map / Reduce parallel computing model abstracts the data processing flow of the application into the Map stage and the Reduce stage. The Map stage and the Reduce stage can be respectively mapped to multiple Map tasks and Reduce tasks to be executed in parallel. Among them, the Map stage mainly performs data transposition and deformation, and the Reduce stage mainly p...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F9/48G06F9/50
Inventor 梁毅王玉凤樊明璐张辰
Owner BEIJING UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products