Storm task deployment and configuration platform with cluster adaptability

An adaptive, cluster technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as cluster processing performance impact, overload, downtime, etc., and achieve good portability and inheritance, deployment and implementation convenient effect

Active Publication Date: 2016-10-12
DALIAN UNIV OF TECH
View PDF4 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

If the number of running processes is set too much, the executors will be distributed to more nodes, which will inevitably increase the communication traffic between nodes. No matter how optimized it is, it is difficult to achieve an ideal scheduling scheme
[0007] (2) If the number of running processes is set too small, the executors will be concentrated in a small number of one or several workers, which may cause thread context switching overhead on the one hand, and more importantly, may cause some nodes to run due to The executor is overloaded and causes downtime
Although these scheduling algorithms can alleviate problems such as node overload and internal communication overhead to a certain extent, they cannot fundamentally solve this problem.
Because users cannot control the global state information of the cluster in real time, blindly setting the number of running processes for tasks in this case will inevitably have a more serious impact on the cluster processing performance

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Storm task deployment and configuration platform with cluster adaptability
  • Storm task deployment and configuration platform with cluster adaptability
  • Storm task deployment and configuration platform with cluster adaptability

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] Below in conjunction with accompanying drawing, this patent is carried out concrete implementation description.

[0033] Such as figure 1 As shown, the system architecture of the invention is logically divided into four layers: resource layer, data layer, application layer, and user layer.

[0034] The resource layer mainly includes the hardware resource Storm cluster and the MySQL database deployed on the master control node to cache monitoring data and cluster resource information. The monitoring thread on each working node of the storm cluster is triggered when the topology task is delivered.

[0035] The data layer obtains data from the monitoring thread through the JAVA object, and reads and writes the database through the JDBC driver. The data layer includes three modules: node management, traffic management, and data management. The main function of node management is to obtain node data from data management and repackage, so as to provide the application layer...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a Storm task deployment and configuration platform with cluster adaptability, and belongs to the field of real-time streaming data computing and processing. Through the platform, a Storm cluster can sense internal communication traffic among nodes and remaining resources, and perform running process number configuration self-adjustment according to a topology task demand published by a user and the cluster remaining resources, so that the limitation of the need of artificially specifying a process number in a conventional Storm scheduling method is broken through. A friendly centralized communication traffic monitoring interface is provided for the user by the platform, and is convenient for the user to call in a task program, so that load and resource awareness is realized. Moreover, a one-phase commitment scheduling method which is different from a conventional Storm two-phase commitment scheduling method is implemented on the platform in an embedding way, and communication traffic optimization among different processes at the same node is realized. Through adoption of the Storm task deployment and configuration platform, optimal scheduling can be realized by setting of basic optimization threshold parameters, and great convenience is brought to a cluster user and a manager.

Description

technical field [0001] It relates to a Storm task deployment and configuration platform with cluster adaptability, belonging to the fields of massive data processing and real-time stream computing. Background technique [0002] With the development of information technology, information presents explosive growth. In many information processing problems, it is necessary to perform real-time complex calculations on streaming big data. This is a new data model. Different from traditional data modeling methods, this type of data is suitable for transient data flow modeling. Scenarios such as Weibo popularity, shopping recommendations, and statistics of router datagrams all require complex decision-making on real-time streaming data. [0003] In the traditional data processing mode, the data is often independent of the application, and the system is responsible for storing the data centrally on the disk, and the data is a static and fixed collection. The core value of stream co...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/2471G06F16/27
Inventor 李克秋邓衍齐恒李文信
Owner DALIAN UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products