Method and device for recovering data in distributed block storage system

A block storage and distributed technology, applied in the direction of data error detection, transmission system, electrical digital data processing, etc., which can solve the problem of service unavailability, front-end application suspension, and restrictions on distributed block storage applications. Scenarios and other problems to achieve the effect of improving data recovery speed, speeding up data recovery speed, and avoiding data access delay

Pending Publication Date: 2020-08-25
SHANGHAI DRAGONNET TECH +1
View PDF6 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

When the cluster state changes, such as node offline / re-online, disk failure, capacity expansion, and the storage location of some data copies of some storage objects changes, the data recovery module generally uses the push method of the primary copy to restore other copies or the pull method to perform recovery from other available copies , when the copy of the data object is not restored, if the front-end application data access occurs, it is generally necessary to wait for the data object to be restored before performing the write operation, which has a huge impact on the performance of the front-end application data access, and may cause the front-end application to hang ,service is not available
The front-end application may only write 4K, and the recovery granularity of data objects is generally at least 1M or above. The problem of write amplification is serious, resulting in the fact that the write performance may drop to one-tenth of the normal cluster when the cluster is abnormal.
At the same time, the storage node does not record incremental write data after it goes offline for a short time. The amount of data to be restored each time is huge, and many unwritten data need to be restored again.
[0003] In the existing public technology, in order to reduce the amount of recovered data, the incremental write data is recorded through the Journal Log when the cluster is abnormal or even the cluster is normal, and the write operation is performed after the Journal Log is persisted to the storage medium. The performance impact of accessing is also huge
In order to balance data recovery and access to front-end application data, generally by setting recovery QOS, limit the rate of data recovery to reduce the impact on front-end application data access, but this setting is generally static, and recovery can be improved in a dynamic way efficiency
[0004] Many of the above problems are completely unacceptable in performance-sensitive application scenarios such as databases, which limits the application scenarios of distributed block storage and can only be applied in edge business scenarios

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for recovering data in distributed block storage system
  • Method and device for recovering data in distributed block storage system
  • Method and device for recovering data in distributed block storage system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] The present invention will be described in detail below in conjunction with the accompanying drawings and specific embodiments. This embodiment is carried out on the premise of the technical solution of the present invention, and detailed implementation and specific operation process are given, but the protection scope of the present invention is not limited to the following embodiments.

[0042] This embodiment provides a method for data recovery in a distributed block storage system. When events such as node offline and disk failure in the cluster cause insufficient data available copies, the available data copies will be restored at a reasonable rate according to the latest status of the storage cluster. normal level.

[0043] Such as figure 1 As shown, the distributed block storage system applied by this method includes a block storage access client, a storage service cluster and a metadata service cluster. The storage service cluster is composed of multiple storag...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a method and a device for recovering data in a distributed block storage system. The distributed block storage system comprises a block storage access client, a storage service cluster and a metadata service cluster; the storage service cluster comprises a plurality of storage service nodes, and the method comprises the following steps: 1) monitoring the cluster state in real time, and executing the step 2) when monitoring that the cluster is abnormal; 2) judging whether to delay reconstruction or not, if so, delaying for a set time and then executing the step 3), andif not, directly executing the step 3); 3) establishing a to-be-recovered data object list; and 4) recovering data according to the to-be-recovered data object list, and executing the data recovery step in parallel by each storage service node. Compared with the prior art, the method has the advantages that the data recovery speed can be increased, the influence of data recovery on front-end application performance can be reduced, and the like.

Description

technical field [0001] The invention relates to the field of computer distributed block storage software systems, in particular to a data recovery method and device in a distributed block storage system. Background technique [0002] In a distributed block storage system, the copy mechanism and DHT hash are generally used to calculate the data storage location. When the cluster state changes, such as node offline / re-online, disk failure, capacity expansion, and the storage location of some data copies of some storage objects changes, the data recovery module generally uses the push method of the primary copy to restore other copies or the pull method to perform recovery from other available copies , when the copy of the data object is not restored, if the front-end application data access occurs, it is generally necessary to wait for the data object to be restored before performing the write operation, which has a huge impact on the performance of the front-end application d...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F11/14H04L29/08
CPCG06F11/1448G06F11/1464H04L67/1097
Inventor 童文飞康亮苏玉军叶磊孙洪标
Owner SHANGHAI DRAGONNET TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products