Method for cluster system data fault tolerance

A cluster system and data technology, applied in the direction of response error generation, redundant code error detection, etc., can solve the problems of poor RAID6 write performance, physical disk damage, complex implementation, etc., to simplify the process, reduce the burden, and solve the problem of high The effect of the fault tolerance problem

Inactive Publication Date: 2008-08-27
LANGCHAO ELECTRONIC INFORMATION IND CO LTD
View PDF0 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, a larger disk space needs to be allocated to the error check block, and there is a greater "write loss" compared with RAID5. Due to the poor write performance and complicated implementation of RAID6, it is difficult to implement RAID6
In order to overcome the above technical deficiencies, there must be a disk fault-tolerant method for high-performance cluster systems, which can use simple XOR operations to quickly solve the problem of data loss due to physical disk damage

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for cluster system data fault tolerance
  • Method for cluster system data fault tolerance
  • Method for cluster system data fault tolerance

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] The method is further described below in conjunction with the accompanying drawings:

[0024] In the scientific computing cluster system, a large amount of computing data is frequently read and written on the storage device, which poses certain risks to the security of the data. The probability of security risks caused by frequent operations increases, and a better guarantee mechanism must be in place. To ensure the security of data, when a man-made or non-man-made data disaster occurs, it is very important to recover data quickly and effectively.

[0025] The storage structure framework of the cluster system is shown in Figure 3. Nodes such as computer nodes and management nodes perform read and write operations on disk array data through the storage manager and system bus or I / O bus.

[0026] In network storage, the structure of the disk array is divided as shown in Figure 4. The disk is divided into blocks. In this figure, a total of N disks form a disk array, and ea...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a data fault tolerance method applicable for a cluster system; when check data is constructed and generated, a data block is transferred to a data reproduction processor from the data stored in a disk driver of a storage unit array; through an exclusive OR operation, the check data is generated and written out to a corresponding check data block, or the data in the data block and the data in the check data block are operated and written out to a corresponding data block. When the data is lost caused by the physical damage of the disk or other causes, the data in the undamaged data block of a connecting relation chain is read to a data regeneration manager and operated to generate lost data; therefore, the reproduced and lost data is written to corresponding data block of a corresponding backup disk or an original data storage data block position, thereby realizing the regeneration and reconstruction of the lost data. Through implementing the invention, the data in a damaged disk of a disk array is restored and reconstructed in time; further the lost data in a single disk or a plurality of disks are reproduced or regenerated in the disk array.

Description

technical field [0001] The invention relates to a fault tolerance method of a disk array, in particular to a disaster recovery method for disk physical damage or data damage using technologies such as disk array or network storage in a cluster system. Background technique [0002] In a high-performance cluster system, a large number of computing nodes operate on storage devices, and users frequently perform data access operations on the cluster through terminals. Frequent reading and writing increases the probability of data loss caused by physical disk damage or misoperation. , how to ensure the security of data is particularly important, and the current fault-tolerant technology more or less has some insufficient I / O read and write efficiency, time efficiency, space efficiency, etc., even when the physical damage of the disk at the same time exceeds Just reach the situation that prior art is powerless when two pieces. [0003] There are currently many solutions to solve d...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F11/08
Inventor 宁雄雁魏健李刚王守昊
Owner LANGCHAO ELECTRONIC INFORMATION IND CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products