Early warning method and device for memory fault

A fault warning and memory technology, applied in the computer field, can solve problems with higher requirements and achieve the effect of rich feature description

Pending Publication Date: 2021-08-24
ALIBABA GRP HLDG LTD
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, since not all exception logs are logs of memory exceptions, and at the same time, analysis based on a large number of exception logs requires high computing resources. Therefore, the prediction accuracy and applicable scenarios of the existing memory fault warning schemes have Certain defects require a more efficient and accurate memory fault early warning solution

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Early warning method and device for memory fault
  • Early warning method and device for memory fault
  • Early warning method and device for memory fault

Examples

Experimental program
Comparison scheme
Effect test

Embodiment approach

[0050] First, for step 102, based on the memory exception logs in the exception logs, analyze the time-series change characteristics of the memory exception logs according to the preset time period, that is, the characteristics constructed from a macro perspective. The specific construction process is as follows figure 2 shown, including:

[0051] Step 201, setting multiple time windows.

[0052] Wherein, the time window is used to specify a time period. The set multiple time windows may be custom-set; or a relatively long time period may be specified, and the corresponding multiple time windows may be determined based on a sliding window algorithm.

[0053] Step 202, using the memory exception log template to filter corresponding types of memory exception logs from the exception logs.

[0054] The memory exception log template in this step is a log template obtained based on the parsing and abstraction of exception logs. The memory exception log with the identification of...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an early warning method and device for a memory fault, relates to the technical field of computers, and mainly aims to describe features corresponding to a memory working state from multiple dimensions so as to predict whether the memory has a fault risk or not. According to the main technical scheme, the method comprises the steps of obtaining an abnormal log of equipment; analyzing time sequence change characteristics of the memory exception logs according to a preset time period on the basis of the memory exception logs in the exception logs; based on the address information of the memory error in the abnormal log, counting the running state characteristics of the memory in the equipment; fusing the time sequence change feature and the operation state feature into a memory fault early warning feature; and processing the memory fault early warning characteristics by using an early warning model to obtain memory early warning information of the equipment.

Description

technical field [0001] The invention relates to the technical field of computers, in particular to a method and device for early warning of memory faults. Background technique [0002] In large-scale data centers, maintaining server stability is crucial. Memory failures can lead to server performance degradation or even downtime, greatly affecting server reliability. Active fault-tolerant technology is an important means to improve the stability of data centers and systems. Fault prediction is an important aspect of active fault-tolerant technology. By monitoring the current and historical status of the server, the research determines whether it will fail within a certain period of time in the future, thereby improving the system. reliability. [0003] The proportion of downtime caused by memory failures has exceeded the proportion of other hardware failures, such as downtime caused by CPU, hard disk and other failures. Therefore, accurate early warning of memory failures, ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F11/34
CPCG06F11/3452G06F11/3476
Inventor 陈品安张羽中李昕何诚王式文杜垠
Owner ALIBABA GRP HLDG LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products