Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Memory error recovery method and device

A memory error and recovery method technology, applied in the field of operating systems, can solve problems such as memory uncorrectable errors and downtime, and achieve the effect of avoiding uncorrectable errors and downtime.

Inactive Publication Date: 2017-02-22
ZHENGZHOU YUNHAI INFORMATION TECH CO LTD
View PDF6 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Memory granules that have continuously occurred correctable errors are usually physically damaged. Damage to the memory on the granules will lead to more serious uncorrectable errors, and may even cause downtime problems. However, there is no relevant prevention in the prior art measure

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Memory error recovery method and device
  • Memory error recovery method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] The core idea of ​​the present invention is to provide a memory error recovery method and device, which can prevent the memory particles that have continuously occurred correctable errors from being called by the operating system, thereby preventing the memory on the damaged particle from causing more serious uncorrectable errors. to avoid downtime.

[0041] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0042] The first memory error recovery method provided by the embodiment of the present application is ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The application discloses a memory error recovery method and device, wherein the method comprises: adding up memory errors correctable; if the number of the memory errors correctable reaches a preset threshold, determining a physical address range affected by damaged memory particles causing the memory errors correctable; killing all processes using the physical address range, and releasing corresponding memory addresses; subjecting all pages comprising the memory addresses to offline processing; restarting all the killed processes. The method and device enable the memory particles already experiencing continuous correctable errors to be called again by an operating system, and accordingly more severe non-correctable errors due to the memory on the damaged particles are avoided to avoid downtime.

Description

technical field [0001] The invention belongs to the technical field of operating systems, and in particular relates to a memory error recovery method and device. Background technique [0002] With the promotion of big data-related industries, IT suppliers are using more and more large-scale servers to provide corresponding computing and storage capabilities. Taking the mainstream 4-socket server as an example, the maximum configuration is 96 pieces of memory. In the data center, the number of tens of thousands of sets makes the number of memory in the data center increase sharply. Along with the increase in the number, there is also the probability of memory errors. Memory errors are divided into correctable memory errors and uncorrectable memory errors. The memory itself has ECC error correction logic. A correctable error means that an error of 1 bit detected by the memory can be corrected. Errors of more than one bit are collectively referred to as uncorrectable errors. ....

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F11/07G06F11/10
CPCG06F11/0751G06F11/073G06F11/0793G06F11/1008
Inventor 黄家明乔英良曹光耀
Owner ZHENGZHOU YUNHAI INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products