Data recovery method, device and system based on distributed system

A data recovery device and a distributed system technology, applied in the computer field, can solve the problems of inability to back up all data, difficult to perform data recovery in a distributed system, and a data recovery method that cannot be applied to a distributed system, thereby achieving the effect of data recovery.

Active Publication Date: 2020-12-25
BEIJING QIHOO TECH CO LTD
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, precisely because of the fact that a complete piece of data in a distributed system is distributed in multiple data sharding clusters, it is currently impossible to perform a full backup of all data stored in each data sharding cluster, which leads to the traditional data recovery method Unsuitable for distributed systems, therefore, data recovery based on distributed systems is difficult

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data recovery method, device and system based on distributed system
  • Data recovery method, device and system based on distributed system
  • Data recovery method, device and system based on distributed system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0022] figure 1 A flow chart of a data recovery method based on a distributed system provided by Embodiment 1 of the present invention is shown. Such as figure 1 As shown, the method includes the following steps:

[0023] Step S110: Determine whether the backup time point corresponding to the data backup of each data fragmentation cluster is consistent with the recovery time point corresponding to the data to be restored.

[0024] Wherein, the distributed system in this embodiment includes at least a plurality of distributed data fragmentation clusters. Specifically, in a distributed system, in order to increase the storage capacity, the data to be stored can be divided into multiple shards, and each shard is stored in a different data shard cluster. Therefore, the specific number of data sharding clusters can be set or adjusted according to the amount of data to be stored, which is not limited in the present invention.

[0025] Specifically, in this embodiment, each data ...

Embodiment 2

[0035] Before introducing this embodiment, first briefly introduce the distributed system in this embodiment. like Figure 2a and Figure 2b As shown, the distributed system in this embodiment includes at least a plurality of distributed data sharding clusters. Figure 2a A schematic structural diagram of the distributed system 200 in the present invention is shown; Figure 2b A schematic diagram of data nodes included in each data sharding cluster in the distributed system 200 is shown.

[0036] Specifically, the distributed system 200 includes a routing node 210 , a configuration node (also called a config node) 220 , and multiple data fragmentation clusters 230 connected to the routing node 210 . in, Figure 2a The illustrated distributed system 200 includes a routing node 210 , and a configuration node 220 . However, in a specific implementation, in order to improve the disaster recovery capability, there may be multiple routing nodes 210 and configuration nodes 220 ...

Embodiment 3

[0067] Figure 4 A structural block diagram of a data recovery device based on a distributed system provided by Embodiment 3 of the present invention is shown. like Figure 4 As shown, the device includes: a consistency determination module 41 , a query module 42 , an operation record determination module 43 , a data recovery module 44 , a backup module 45 and a balanced log acquisition module 46 .

[0068] Wherein, the distributed system in this embodiment is the same as the distributed system in Embodiment 1, including at least a plurality of distributed data fragmentation clusters. Each module in this embodiment will be introduced in detail below.

[0069] The consistency determining module 41 is adapted to separately determine whether the backup time point corresponding to the data backup of each data fragmentation cluster is consistent with the recovery time point corresponding to the data to be restored.

[0070] Specifically, for the determination of the backup time ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention discloses a data recovery method, apparatus and system based on a distributed system. The method comprises: determining whether a backup time point corresponding to each data segment cluster during data backup is consistent with a recovery time point corresponding to to-be-recovered data; for the data segment cluster whose backup time point is inconsistent with a recovery time point, searching for a configuration log used for recording metadata information corresponding to each data segment cluster; according to the configuration log, determining a data operation record corresponding to the data segment cluster whose backup time point is inconsistent with the recovery time point between the backup time point and the recovery time point; and performing data recovery according to the data operation record and backup data corresponding to the data segment cluster whose backup time point is inconsistent with the recovery time point. According to the method provided by the present invention, data backup can be performed for each data segment cluster, so that data recovery of the distributed system is realized accurately and efficiently.

Description

technical field [0001] The present invention relates to the field of computer technology, in particular to a data recovery method, device and system based on a distributed system. Background technique [0002] Data storage and recovery play a vital role in Internet business. Whether it is after a misoperation or a storage system crash, data recovery is required. Traditional data recovery methods are based on data backup. For example, in a traditional non-distributed system, all data in the storage system is backed up to obtain the full amount of data in the storage system based on a certain point in time, and then, on the basis of the full amount of data, according to the log file, the storage All data in the system is restored to the storage state corresponding to the specified time point. [0003] In the process of implementing the embodiments of the present invention, the inventors found at least the following problems in the prior art: With the continuous development ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F11/14
CPCG06F11/1448G06F11/1464
Inventor 李丹
Owner BEIJING QIHOO TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products