Storm-based stream computing bipartite graph task scheduling method

A technology of task scheduling and bipartite graph, which is applied in computing, resource allocation, program control design, etc., can solve the problems of decreased system throughput, heavy physical load, and increased read data bandwidth and delay overhead, so as to reduce delay and avoid The effect of resource load imbalance

Inactive Publication Date: 2018-07-03
NORTHWEST UNIV(CN)
View PDF0 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0010] (1) A certain type of task is sensitive to CPU or memory. If the same CPU-sensitive task is scheduled on the same physical machine, it may make the multi-dimensional resource utilization of the machine unbalanced. For example, the CPU resource load is too high, while the memory resources are idle;
[0011] (2) In a heterogeneous cluster, different physical machines have different resources (CPU, memory, network bandwidth, etc.), and a simple even distribution strategy for task threads may lead to excessive physical loads with scarce resources, resulting in system throughput. decline;
[0012] (3) The data of a certain task is on node A, but it is scheduled to be executed on node B, which undoubtedly increases the bandwidth and delay overhead of reading data. The influence in can not be ignored;

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Storm-based stream computing bipartite graph task scheduling method
  • Storm-based stream computing bipartite graph task scheduling method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] The present invention will be further described in detail below in conjunction with the accompanying drawings and examples. The following examples are explanations of the present invention and the present invention is not limited to the following examples.

[0025] see Figure 1-Figure 2 , the present embodiment is based on a Storm-based stream computing bipartite graph task scheduling method, which is characterized in that: the Storm job directed acyclic graph DAJG (Directed Acyclic Job Graph) node and cluster physical machine node undirected graph (Undirected Node Graph ) nodes are regarded as two types of vertices of the bipartite graph, construct a bipartite graph model, integrate the computing power of each node of the cluster physical machine and the network transmission delay in the cluster, and according to the schedulable relationship between tasks and node resources, use the maximum weight The value matching algorithm is used for task scheduling, and the speci...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a Storm-based stream computing bipartite graph task scheduling method. The method is characterized in that Storm DAJG (Directed Acyclic Job Graph) nodes and cluster physical machine UNG (Undirected Node Graph) nodes are seen as two types of vertexes of a bipartite graph; a bipartite graph model is built; and in combination with computing capabilities of nodes of physical machines in a cluster and network transmission delay in the cluster, and according to schedulable relationships between tasks and node resources, task scheduling is performed by adopting an algorithm for solving maximum weight matching of the bipartite graph. According to the method, under the condition of ensuring resource load balance of the physical machines in the cluster, the network delay in adata stream migration process during task execution is shortened, so that the overall performance of a system is improved.

Description

technical field [0001] The invention relates to a Storm-based stream computing bipartite graph task scheduling method. Background technique [0002] With the rapid development of information science and technology, applications in the cloud computing and Internet of Things environment present the characteristics of large data volume, continuous data flow, multi-source concurrency, and real-time processing. The real-time processing of these streaming data is called streaming data processing or streaming computing. In the big data streaming computing system Storm, multi-task scheduling is a key factor affecting the performance of the streaming computing system Storm. Tasks in a streaming computing system have two typical characteristics: [0003] (1) Multi-task and multi-stage features. [0004] From a theoretical model point of view, the tasks submitted to the system can be represented by a directed acyclic graph (DAG) during processing [1]. That is to say, task schedulin...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F9/50
CPCG06F9/5027G06F2209/502
Inventor 马力吴江田小伟
Owner NORTHWEST UNIV(CN)
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products