Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A hadoop scheduling method and system based on bandwidth awareness

A scheduling method and bandwidth-aware technology, applied in the direction of multi-programming devices, etc., can solve the problems of slow job response time and low overall performance of the Hadoop platform, and achieve the effect of improving the response speed.

Inactive Publication Date: 2017-07-18
HUAZHONG UNIV OF SCI & TECH
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] For the above defects or improvement needs of the prior art, the present invention provides a Hadoop scheduling method and system based on bandwidth perception. low technical issues

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A hadoop scheduling method and system based on bandwidth awareness
  • A hadoop scheduling method and system based on bandwidth awareness
  • A hadoop scheduling method and system based on bandwidth awareness

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0049] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention. In addition, the technical features involved in the various embodiments of the present invention described below can be combined with each other as long as they do not constitute a conflict with each other.

[0050] The general idea of ​​the present invention is that, as a bandwidth-aware Hadoop scheduling algorithm, the bandwidth-aware characteristic of SDN (software-defined network) is mainly used to optimize the local node priority algorithm of Hadoop itself. Calculate the data migration time through the link bandwidth obtained by the software-defined network, estimate the idle time of the nodes, ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a Hadoop scheduling method based on bandwidth perception, which includes: establishing a job time completion model for Hadoop task scheduling, building a mathematical model for the Hadoop scheduling system, and converting the Hadoop task scheduling problem into a job to be scheduled The problem of finding a task scheduling method that makes the job completion time the shortest; using the real-time network management and flow control functions provided by SDN, a network bandwidth allocation mechanism based on time slots is proposed, and the remaining bandwidth of each link The occupation period is divided into equal time slots, based on the job completion time model and the network slot bandwidth allocation mechanism; before assigning computing nodes to a task, the locality of the task and the real-time network bandwidth are considered comprehensively, for Each task is assigned a computing node that can provide the earliest completion time. The invention solves the problem that in the existing method, task scheduling cannot be performed from two aspects of the global perspective and the available bandwidth of the actual network at the same time.

Description

technical field [0001] The invention belongs to the field of information processing and data calculation, and more specifically relates to a Hadoop scheduling method and system based on bandwidth perception. Background technique [0002] With the advancement of science and technology, Internet technology has developed rapidly, which not only promotes social development, but also greatly enriches people's network life. With the arrival of WEB2.0, the Internet has undergone earth-shaking changes. A prominent feature of WEB2.0 is user generated content (User generated content). A large number of user generated content has resulted in explosive growth of data. Faced with the challenges of large-scale data processing technology, cloud computing has been proposed as a new model for computing and processing large-scale data. Thanks to the joint evolution of multiple technologies such as distributed and virtualization, cloud computing has emerged as a new type of big data processi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F9/46
Inventor 戴彬秦鹏邵翔邹云飞
Owner HUAZHONG UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products