Distributed monitoring system based on Hadoop cluster and monitoring method thereof

A hadoop cluster and distributed monitoring technology, applied in the field of distributed computing, can solve the problems of unable to start or stop monitoring behavior, large impact on system performance, and large amount of data

Active Publication Date: 2011-07-20
SUZHOU INST FOR ADVANCED STUDY USTC
View PDF4 Cites 51 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Existing monitoring systems have very obvious shortcomings in practical applications
First of all, the data obtained by the monitoring framework is not accurate enough, and there is a large amount of interference data. Whether using black-box or white-box methods, for a computer cluster with thousands of servers, the amount of monitored data is too large, even in self-learning Or with the help of classification algorithms, the analysis workload of thes...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Distributed monitoring system based on Hadoop cluster and monitoring method thereof
  • Distributed monitoring system based on Hadoop cluster and monitoring method thereof
  • Distributed monitoring system based on Hadoop cluster and monitoring method thereof

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0057] Such as figure 1 Shown is a schematic diagram of the architecture of the Hadoop cluster-based distributed monitoring system of the present invention. Contains one or more Client nodes, one Master node, and multiple Slave nodes. Among them, the Master node corresponds to the Master node in the Hadoop background system, the Slave node corresponds to multiple Slave nodes in Hadoop, and the Client node is the monitoring submission server allowed by the cluster.

[0058] The modules required on each Client node server are as follows:

[0059] 1. ProbeClient module

[0060] The user submits monitoring through the ProbeClient module, which is an important part of the task submission module in the present invention. The user needs to provide at least two files for system monitoring under the framework of the present invention, a script file probe.xml whose format is exactly the same as the configuration script of Hadoop itself. The script indicates the monitoring point of t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a distributed monitoring system based on a Hadoop cluster and a distributed monitoring method thereof. The system comprises a client, a master computer and a slave computer. The system is characterized in that: the master computer and the slave computer in the system are constructed based on a MapReduce framework of the Hadoop cluster; the client is used for submitting a monitoring work request to the master computer; after responding to the monitoring work request, the master computer divides monitoring work and distributes the divided monitoring work to the salve computer for independent completion; and the slave computer is used for completing an independent monitoring work task, integrating a result through the master computer and returning data to the client. By the invention, data such as the task schedule and the like of a distributed computing network MapReduce can be effectively monitored, the monitoring result can be precisely obtained in real time, and independent dynamic switching is realized under the condition of not influencing work execution.

Description

technical field [0001] The invention belongs to the technical field of distributed computing, and relates to a monitoring system applied to a Hadoop distributed operation system, in particular to a monitoring and control system for a large-scale distributed cluster running a MapReduce program. Background technique [0002] Hadoop is an open source distributed middleware for running MapReduce tasks. As an important technical component, it is widely used in various current cloud computing background systems. Generally, the number of commercial Hadoop clusters ranges from hundreds to thousands. As the scale of clusters managed by Hadoop becomes larger and larger, its management becomes more and more difficult. Generally, errors generated during Hadoop cluster operation can be divided into hardware errors, operating system errors, middleware errors, and user program errors according to their sources. There are many sources of errors, the interaction of various factors in the cl...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): H04L29/08
Inventor 周学海吕松武杨峰代栋孙明明陈涛
Owner SUZHOU INST FOR ADVANCED STUDY USTC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products