System and method for intelligently monitoring large-scale data center cluster computing nodes

A large-scale data and computing node technology, applied in the computer field, can solve problems such as equipment that cannot accurately locate faults, time-consuming and labor-intensive problems, etc.

Active Publication Date: 2014-11-19
INSPUR BEIJING ELECTRONICS INFORMATION IND
View PDF3 Cites 20 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The personnel in charge of operation and maintenance need to check the running status of the cluster in real time. Once a problem occurs, although the location of the node can sometimes be located, it is often impossible to accurately locate the faulty device. It also takes time and effort to judge and troubleshoot through the experience of the staff; the cluster Although many users can understand their own job status through many job scheduling software, they can rarely count the historical analysis of jobs; moreover, cluster decision makers often cannot directly obtain relevant expenses, usage efficiency, and personnel work efficiency from the cluster. , cost-benefit ratio and other relevant decision-making information can only be made through manual analysis of a large amount of data, which is time-consuming and labor-intensive
In addition, application developers are often unable to obtain information such as hardware microarchitecture, system process, stack, and module error and crash statistics that are urgently needed to optimize application software from the cluster. They need to obtain it through a large number of experiments based on experience, which is time-consuming and laborious.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • System and method for intelligently monitoring large-scale data center cluster computing nodes
  • System and method for intelligently monitoring large-scale data center cluster computing nodes

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] In order to make the purpose, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0029] See attached figure 1 , shows a system for intelligently monitoring large-scale data center cluster computing nodes proposed by the present invention, including monitoring nodes installed on data center cluster computing nodes, monitoring equipment connected to each monitoring node, and user terminal equipment. Wherein the data center cluster computing node has corresponding hardware devices, such as processor CPU, memory, hard disk, Ethernet controller, etc., and the operating system and application software are run on the computing node; the monitoring equipment includes a main monitoring node and a database, and the main The monitoring node communicates with each monitoring node installed on the above computing nodes, and can obtain the hardware and sof...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a system and a method for intelligently monitoring large-scale data center cluster computing nodes. Hardware microarchitecture data indexes of the computing nodes and data indexes related to processes of running applications are acquired through monitoring nodes in the system, the data indexes are transmitted to monitoring equipment in the system, big data analysis is executed via the monitoring equipment, and results are sent to customer premise equipment to be displayed to a user. By the system and the method, the microarchitecture data indexes of the computing nodes and the data indexes of the processes of the running applications are acquired, so that intelligent big data analysis is realized, the faulted computing nodes are positioned automatically, and fault causes are provided.

Description

technical field [0001] The invention relates to the field of computer technology, in particular to a system and method for intelligently monitoring large-scale data center cluster computing nodes. Background technique [0002] With the continuous progress of human society and the development of science and technology, people not only have a wider understanding of the natural world, but also have an increasingly urgent need to explore the unknown world. This has led to a sharp increase in the amount of information and data held by human beings. At the same time, these massive information and data need to be analyzed and processed in a timely manner. For example, a large astronomical radio telescope array can generate more than 100GB of cosmic microwave data in one second, and these data need to be analyzed in time; Data is also measured in terabytes; in addition, fields such as human genome engineering, oil exploration, weather forecasting, etc. have also put forward higher ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F11/30
Inventor 刘羽吕文静金莲陈博文于涛
Owner INSPUR BEIJING ELECTRONICS INFORMATION IND
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products