Method and system for increasing parallelism of disk accesses when restoring data in a disk array system

a disk array system and parallelism technology, applied in the field of data protection methods for data storage, can solve the problems of increasing cpu overhead, complicated calculations that are also repetitive, and adding additional hardware required, so as to achieve less overhead and increase parallelism.

Inactive Publication Date: 2006-06-08
IBM CORP
View PDF23 Cites 28 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

[0018] In another aspect, parallelism may be increased in a disk array system by overlapping disk accesses associated with different parity stripes when restoring data in a disk array (e.g., to resynchronize parity and data, or to rebuild data for an exposed disk). Specifically, consistent with this aspect of the invention, restoring data to a disk array may include the retrieval of a first set of data associated with a first parity stripe, coupled with concurrent operations of writing to the disk array a result value generated by processing the first set of data, and reading from the disk array a second set of data associated with a second parity stripe. By overlapping read and write accesses associated with different parity stripes, data associated with multiple parity stripes may be restored with less overhead than if the accesses and operations associated with restoring data to different parity stripes were performed sequentially.

Problems solved by technology

In the former instance, the RAID algorithms are packaged into separate controller hardware coupled to the computer input / output (“I / O”) bus and, although adding little or no central processing unit (“CPU”) overhead, the additional hardware required nevertheless adds to the overall system cost.
On the other hand, software implementations incorporate the RAID algorithms into system software executed by the main processor together with the operating system, obviating the need and cost of a separate hardware controller, yet adding to CPU overhead.
The mathematics of implementing RAID-6 involve complicated calculations that are also repetitive.
For example, one limitation of existing RAID-6 designs relates to the performance overhead associated with performing resync (where parity data for a parity stripe is resynchronized with the current data), rebuild (where data from a faulty or missing drive is regenerated based upon the parity data) or other exposed mode operations such as exposed mode reads.
In each of these exposed mode operations, however, the requirements of reading data from certain disks and writing data back to certain disks results in substantial performance overhead, specifically with respect to the sequential nature of the various disk access operations on the disk array.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for increasing parallelism of disk accesses when restoring data in a disk array system
  • Method and system for increasing parallelism of disk accesses when restoring data in a disk array system
  • Method and system for increasing parallelism of disk accesses when restoring data in a disk array system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] The embodiments discussed hereinafter utilize one or both of two techniques to increase parallelism and otherwise reduce the overhead associated with restoring data in a disk array environment such as a RAID-6 environment. One technique described hereinafter selects different subsets of disks to access in connection with an operation such as a rebuild or exposed read operation. Another technique described hereinafter overlaps read and write accesses associated with restoration operations performed with respect to multiple parity stripes.

[0024] Presented hereinafter are a number of embodiments of a disk array environment implementing the aforementioned techniques. However, prior to discussing such embodiments, a brief background on RAID-6 is provided, followed by a description of an exemplary hardware environment within which the aforementioned techniques may be implemented.

General RAID-6 Background

[0025] The nomenclature used herein to describe RAID-6 storage systems conf...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

In a disk array environment such as a RAID-6 environment, the overall performance overhead associated with exposed mode operations such as resynchronization, rebuild and exposed mode read operations is reduced through increased parallelism. By selecting only subsets of the possible disks required to solve a parity stripe equation for a particular parity stripe, accesses to one or more disks in a disk array may be omitted, thus freeing the omitted disks to perform other disk accesses. In addition, disk accesses associated with different parity stripes may be overlapped such that the retrieval of data necessary for restoring data for one parity stripe is performed concurrently with the storage of restored data for another parity stripe.

Description

CROSS-REFERENCE TO RELATED APPLICATIONS [0001] This application is related to the following U.S. patent applications all filed on even date herewith by Carl Edward Forhan, Robert Edward Galbraith and Adrian Cuenin Gerhard: Ser. No. ______, entitled “METHOD AND SYSTEM FOR ENHANCED ERROR IDENTIFICATION WITH DISK ARRAY PARITY CHECKING,” Ser. No. ______, entitled “RAID ENVIRONMENT INCORPORATING HARDWARE-BASED FINITE FIELD MULTIPLIER FOR ON-THE-FLY XOR,” Ser. No. ______, entitled “METHOD AND SYSTEM FOR IMPROVED BUFFER UTILIZATION FOR DISK ARRAY PARITY UPDATES,” and Ser. No. ______, entitled “METHOD AND SYSTEM FOR RECOVERING FROM ABNORMAL INTERRUPTION OF A PARITY UPDATE OPERATION IN A DISK ARRAY SYSTEM.” Each of these applications is incorporated by reference herein.FIELD OF THE INVENTION [0002] The present invention relates to data protection methods for data storage and, more particularly, to systems implementing RAID-6 and similar data protection and recovery strategies. BACKGROUND OF ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(United States)
IPC IPC(8): H03M13/00
CPCG06F11/1092G06F2211/1052G06F2211/1057
Inventor FORHAN, CARL EDWARDGALBRAITH, ROBERT EDWARDGERHARD, ADRIAN CUENIN
Owner IBM CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products