Distributed storage method, device and system

A distributed storage, physical storage node technology, applied in the field of distributed storage, can solve problems such as unrecoverable, data loss, and the unavailability of the entire file

Inactive Publication Date: 2014-08-13
HUAWEI TECH CO LTD
View PDF5 Cites 84 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, due to the randomness of DHT, it is unavoidable to deploy multiple data blocks of the same data slice on the same physical storage node, so it is unavoidable that a single physical storage node (for example, when a rack, server, or hard disk) fails to cause data risk of loss
For example, using M+N erasure coding technology, where M is the number of data blocks and N is the number of check blocks, when more than N+1 data blocks or check blocks are deployed on the same hard disk, The hard disk failure will result in the loss of the above M d...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Distributed storage method, device and system
  • Distributed storage method, device and system
  • Distributed storage method, device and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0054] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0055] In a distributed storage system, multiple hard disks can be constructed as a storage array, and the storage space of each hard disk can be divided into multiple virtual nodes (Virtual Node, VN), also called partitions (Partition). Each virtual node has an independent number, and virtual nodes usually have the same storage space. For example, virtual nodes with logical numbers are randomly deployed on hard disks with physical numbers.

[0056] Wh...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

Provided are a distributed data storage method, device and system. The method comprises: splitting a data file, generating K data slices, splitting each data slice of the K data slices, generating M data blocks for each data slice, performing check encoding on the M data blocks using a redundancy algorithm, and generating N check blocks (110); determining a first physical storage node corresponding to a block of the M data blocks and the N check blocks using a random algorithm, and determining at least M+1 different physical storage nodes according to a first ranking manner conforming to a rule, based on the first determined physical storage node (120); and respectively storing at least M+1 blocks of the M data blocks and the N check blocks on the at least M+1 different storage nodes, with K, M and N being integers (130). The method, device and system can improve the reliability of distributed storage systems.

Description

technical field [0001] The present invention relates to the field of data storage, in particular to a distributed storage method, device and system. Background technique [0002] In a distributed storage system, in order to achieve reliability, file-level multi-copy redundancy technology or data block-level redundancy coding technology, such as erasure coding (Erasure Coding) technology, is used. In multi-copy redundancy technology, the probability of data loss can be reduced by storing multiple identical copies of a data file, while in redundant coding technology, a check block can be added to any local data in the file to improve reliability. [0003] Usually, a distributed hash table (Distributed Hash Table, DHT) can be used to store data blocks and check blocks. However, due to the randomness of DHT, it is unavoidable to deploy multiple data blocks of the same data slice on the same physical storage node, so it is unavoidable that a single physical storage node (for ex...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F11/14G06F12/08
CPCG06F12/02G06F11/1088
Inventor 冯斌黄澄龚学文
Owner HUAWEI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products