MapReduce bandwidth optimization method

An optimization method and bandwidth technology, applied in the field of Hadoop cloud computing, can solve problems such as insufficient automation of scheduling and deployment, no separation of task scheduling and resource allocation, and inability to optimize scheduling, so as to shorten data migration time, solve network congestion problems, and improve work efficiency. The effect of efficiency

Inactive Publication Date: 2016-02-24
HUAZHONG UNIV OF SCI & TECH
View PDF14 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In response to this problem, researchers have proposed some optimization schemes, such as MapReduce column storage optimization, MapReduce connection optimization, MapReduce scheduling optimization, etc., but most of the above schemes have the following problems: task scheduling and resource allocation are not separated, scheduling deployment is not automatic enough, and cannot be quickly implemented. Ok, optimize scheduling according to network conditions, etc.
However, when a large number of data streams emerge, the congestion avoidance path is also unable to change the congestion situation.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • MapReduce bandwidth optimization method
  • MapReduce bandwidth optimization method
  • MapReduce bandwidth optimization method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention. In addition, the technical features involved in the various embodiments of the present invention described below can be combined with each other as long as they do not constitute a conflict with each other.

[0033] figure 1 It is a schematic diagram of the MapReduce bandwidth optimization method of the present invention under the SDN and Hadoop systems. include:

[0034] (1) After the Hadoop job is submitted, the JobTracker sends the task execution node information to the OpenFlow controller;

[0035](2) According to the received task execution node information, the OpenFlow controller further de...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a MapReduce bandwidth optimization method which comprises the following steps: after Hadoop work is submitted, an OpenFlow controller, according to task execution node information sent by JobTracker and through a Map intermediate value routing strategy, determines nodes for carrying out a Map intermediate value merge task, updates corresponding flow entry and sends the flow entry to an OpenFlow switch; and the OpenFlow switch receives and installs the flow entry through a secure channel, then, carries out flow entry matching on the received data packets, and if behavior type for merging the Map intermediate values is matched therewith, carries out merging of the Map intermediate values. Through the combination of the OpenFlow and Hadoop, and by utilizing data handling capacity of the OpenFlow switch, the Map intermediate values, that is, the intermediate tuple data obtained after the action of a map() function, are subjected to merge processing in advance, thereby greatly relieving the problem of network congestion in the data migration process and improving Hadoop work efficiency substantially.

Description

technical field [0001] The design of the present invention belongs to the Hadoop cloud computing field, and more specifically relates to a MapReduce bandwidth optimization method. Background technique [0002] MapReduce distributed computing requires a large number of one-to-many or many-to-many communications between servers. This makes the data center network often congested in the current technical environment, resulting in increased packet loss, increased transmission delay, and decreased throughput. Especially in the MapReduce process of Hadoop cloud computing, when the main server assigns the Map task and the Reduce task, the Map server starts to perform calculation work and migrates the calculated intermediate value to the Reduce server. During the migration process of a large amount of data, it is very difficult to It is easy to cause network congestion, making the Reduce server wait for a long time or even fail. The time it takes for data to travel across the netw...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): H04L12/733H04L12/741H04L29/08H04L45/122H04L45/74
CPCH04L45/122H04L45/745H04L67/10H04L67/63
Inventor 戴彬杨军王曼吕璐徐冠
Owner HUAZHONG UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products