Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A job scheduling method for map/reduce massive data processing platform

A job scheduling and job technology, applied in the direction of program startup/switching, resource allocation, multi-programming device, etc., can solve problems such as reducing the resource utilization rate of the Map/Reduce platform

Inactive Publication Date: 2018-05-01
BEIJING UNIV OF TECH
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In the existing scheduling method, when the Reduce task is idle and waiting, the computing resources allocated to it are not released, which greatly reduces the resource utilization of the Map / Reduce platform

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A job scheduling method for map/reduce massive data processing platform
  • A job scheduling method for map/reduce massive data processing platform
  • A job scheduling method for map/reduce massive data processing platform

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0111] The present invention will be described below in conjunction with the drawings and specific embodiments.

[0112] figure 1 It is a system deployment diagram of a Map / Reduce type massive data processing platform using the method of the present invention. The system adopting the method of the present invention can be deployed on a computer cluster. The cluster contains multiple servers (cluster nodes), and the servers are connected through a network. Cluster nodes are divided into two categories, including one management node and multiple computing nodes. The Map / Reduce type massive data processing platform adopting the method of the present invention includes four core modules: a job scheduling module, an application management module, a task execution module and a node management module. Among them, the job scheduling module is deployed on the management node; the application management module, the node management module, and the task execution module are all deployed o...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A job scheduling method for a Map / Reduce massive data processing platform belongs to the field of massive data processing, and particularly relates to job scheduling and resource management in the Map / Reduce massive data processing platform. The present invention proposes a Map / Reduce job scheduling method based on Reduce task resource preemption, which realizes that when the Reduce task is idle and waiting, the computing resources occupied by it are allocated to the Map task for use. The invention mainly includes: preempting resources, a job scheduling module, an application management module, a task execution module, a node management module, a task suspension and recovery method, and a job scheduling method oriented to preempting resources. The present invention can be applied to the job scheduling and resource management of the data center. Through the preemptive scheduling of the Reduce task in the Map / Reduce job, the reasonable allocation of computing resources in the data center can be realized, the utilization rate of computing resources can be improved, and the running time of jobs can be shortened.

Description

Technical field [0001] The invention belongs to the field of massive data processing, and particularly relates to job scheduling and resource management in a Map / Reduce type massive data processing platform. Background technique [0002] The Map / Reduce massive data processing platform is the latest technological advancement in the field of massive data processing. It mainly serves big data applications with a one-time write, multiple read data access mode and an Embarrassing Parallel computing mode. The Map / Reduce platform provides the Map / Reduce parallel computing model and its corresponding runtime environment. The Map / Reduce parallel computing model abstracts the application data processing flow into the Map phase and the Reduce phase. The Map phase and the Reduce phase can be respectively mapped to multiple Map tasks and Reduce tasks for parallel execution. Among them, the Map phase mainly performs data transposition and deformation, and the Reduce phase mainly performs dat...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F9/48G06F9/50
Inventor 梁毅王玉凤樊明璐张辰
Owner BEIJING UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products