Method, apparatus and system for data reconstruction in distributed storage system

A technology of distributed storage and data reconstruction, applied in the IT field, can solve problems such as reconstruction, and achieve the effect of reducing consumption, avoiding computing resource consumption, and avoiding network resource consumption

Active Publication Date: 2017-05-10
HUAWEI CLOUD COMPUTING TECH CO LTD
View PDF4 Cites 78 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] This application describes a method, device, and system for data reconstruction in a distributed storage system, which solves the problem of rebuilding data on a faulty node after the faulty node is restored, and does not need to use EC back-coding to restore data, reducing computing power. Resource and network resource consumption

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method, apparatus and system for data reconstruction in distributed storage system
  • Method, apparatus and system for data reconstruction in distributed storage system
  • Method, apparatus and system for data reconstruction in distributed storage system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] The technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings. Apparently, the described embodiments are only some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0044] The network architecture and business scenarios described in the embodiments of the present invention are to more clearly illustrate the technical solutions of the embodiments of the present invention, and do not constitute limitations on the technical solutions provided by the embodiments of the present invention. The evolution of the architecture and the emergence of new business scenarios, the technical solutions provided by the embodiments of the present invention are also ap...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a method, apparatus and system for data reconstruction in a distributed storage system. A main storage node in the distributed storage system performs EC coding on the data to be written to generate an EC stripe; each EC block on the EC stripe is stored on each storage node; when part of the storage nodes cause failure of writing of EC blocks because of faults, the main storage node stores the EC blocks distributed to the storage nodes which fails to write in the local, and generates metadata information required by data reconstruction; and when the faults of the storage nodes recover, the main storage node retransmit the stored EC blocks distributed to the storage nodes and the metadata information corresponding to the EC blocks to the storage nodes so as to enable the storage nodes after recovering the faults to complete data reconstruction. The scheme for data reconstruction in the distributed storage system has the advantages that when part of the storage nodes malfunction, EC reversal coding is not required to recover the data on the fault nodes, but the buffer of main storage node is distributed to the EC blocks with fault nodes; after the fault nodes recover, the buffered EC blocks are retransmitted to the fault nodes to perform data reconstruction, thus avoiding computing resource consumption because of performing EC reversal coding when the storage nodes recover fault to perform data reconstruction, and avoiding consumption of network resource because large amount of data is transmitted during the EC reversal coding process at the same time.

Description

technical field [0001] The present invention relates to the field of IT technology, in particular to a data reconstruction method, device and system in a distributed storage system. Background technique [0002] In a storage system, in order to ensure data security, a multi-copy storage technology is usually used to implement data redundancy backup. Multi-copy redundancy technology is to store multiple identical copies of a piece of data at the same time. When one piece of data is lost, the lost data can be recovered through the data of other copies, thereby reducing the probability of data loss. The increase in the number of copies will greatly increase the consumption of system storage space and network bandwidth, thereby increasing the cost of data storage. For example, in the case of two copies, the real available space for users is 50% of the total storage space of the entire system, while in the case of three copies, the real available space for users is only 33%. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F3/06
CPCG06F3/0608G06F3/064G06F3/067H03M13/3761H03M13/1515H03M13/154G06F11/1076G06F3/0619G06F3/0647G06F11/1088G06F11/2094
Inventor 曾永强
Owner HUAWEI CLOUD COMPUTING TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products