Cluster log feature extraction method and device and storage medium

A feature extraction and logging technology, applied in special data processing applications, digital data information retrieval, instruments, etc., can solve problems such as the inability to quickly locate error information, the inability to globally understand the operation of the cluster storage system, and the complexity of cluster system management. , to reduce production accidents, facilitate fault prediction and fault classification

Pending Publication Date: 2019-07-09
PING AN TECH (SHENZHEN) CO LTD
View PDF4 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In the actual operation of the cluster storage system, a cluster storage system log management method is commonly used at present, which can send system logs regularly or in real time, and realize the centralized transmission of logs, but without analyzing and managing the logs, it is impossible to understand the entire cluster globally The operation status of the storage system cannot quickly locate the error message
However, as the number of cluster nodes increases, the management of the cluster system becomes more and more complicated

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cluster log feature extraction method and device and storage medium
  • Cluster log feature extraction method and device and storage medium
  • Cluster log feature extraction method and device and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0055] Embodiments of the cluster log feature extraction method, device and storage medium of the present invention will be described below with reference to the accompanying drawings. Those skilled in the art would recognize that the described embodiments can be modified in various ways or combinations thereof without departing from the spirit and scope of the invention. Accordingly, the drawings and description are illustrative in nature and not intended to limit the scope of the claims. Also, in this specification, the drawings are not drawn to scale, and like reference numerals denote like parts.

[0056] Such as figure 1 As shown, the cluster log feature extraction method of this embodiment includes the following steps:

[0057] Step S10, collect the logs of the server cluster through the flume (distributed massive log collection, aggregation and transmission system) client, and send them to the Hbase database server. Flume takes the Agent process as the smallest indep...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to base frame operation and maintenance, and provides a cluster log feature extraction method and device and a storage medium, and the method comprises the steps: collecting a log of a server cluster through a flume client, and sending the log to a database; performing data cleaning on the log data, and screening out original data; extracting characteristic values including amean value, an effective value, a peak value, a square root amplitude value, a waveform index, a pulse index and a kurtosis index from the original data; and respectively carrying out Pearson correlation coefficient operation on the extracted characteristic values and original data, comparing the calculated correlation coefficient with a correlation degree threshold value, regarding the data as valid data if the correlation degree is higher than the correlation degree threshold value, regarding the data as invalid data if the correlation degree is lower than the correlation degree threshold value, and eliminating the invalid data. According to the invention, the effective information of the production data of each host in the server cluster can be effectively screened out, and the characteristic values of the production data are extracted from the effective information, so that the fault prediction and fault classification of a production system are facilitated, and the occurrence ofproduction accidents is reduced.

Description

technical field [0001] The present invention relates to base frame operation and maintenance, in particular to a cluster log feature extraction method, device and storage medium. Background technique [0002] In the era of explosive growth of information, the file size and data scale have become a reality, and the number of nodes in the cluster storage system has reached 64 node clusters. Managing such a large cluster system has become a severe challenge for the data center. challenge. It is particularly important to track the running status of cluster nodes in time and accurately locate node error information. In the actual operation of the cluster storage system, a cluster storage system log management method is commonly used at present, which can send system logs regularly or in real time, realizing the centralized transmission of logs, but without analyzing and managing the logs, it is impossible to understand the entire cluster globally The running status of the stora...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/17G06F16/182G06F16/215
CPCG06F16/215G06F16/17G06F16/182
Inventor 吴超勇陈仕财
Owner PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products