A backup cluster based on two levels of data deduplication two-level data de-duplication

A technology of data backup and data, which is applied in the direction of data error detection and response error generation, which can solve the problem that duplicate data blocks cannot be eliminated.

Active Publication Date: 2019-02-19
苏州爱洛克信息技术有限公司
View PDF2 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This technology can cluster similar data to the same n...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A backup cluster based on two levels of data deduplication two-level data de-duplication
  • A backup cluster based on two levels of data deduplication two-level data de-duplication
  • A backup cluster based on two levels of data deduplication two-level data de-duplication

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0175] The invention discloses a backup cluster based on two-level data deduplication, such as figure 1 As shown, it includes multiple storage nodes interconnected through the storage network, and each storage node can receive data sent by the client and back up the data to the cluster, or restore specified data from the cluster.

[0176] As shown in Figure 2, a software system based on two-level data deduplication and a disk device are installed on the storage node; the software system based on two-level data deduplication is divided into the upper layer responsible for the first level of data deduplication, namely the First-level data deduplication storage system, the bottom layer responsible for second-level data deduplication, that is, the Delta compression storage component; the disk device is equipped with a data block sub-index, a container index, and a container storage pool; the data block sub-index is composed of block-level The data deduplication storage system is m...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a backup cluster based on two-level data de-duplication. The backup cluster comprises a plurality of storage nodes, which are interconnected by a storage network and cooperatewith each other to complete data backup and recovery. The first level of two-level data de-duplication adopts block compression technology, which not only eliminates the duplicate data blocks within and between nodes in the whole backup cluster, but also clusters new data blocks with similar contents to the same storage node. The second level is completed by each storage node separately. Delta compression technology based on block-level data de-duplication is used to compress the similar data blocks in this storage node to eliminate the byte-level duplicate data. The invention adopts an effective fingerprint query algorithm and a data buffer technology, has good data backup and recovery performance, simultaneously supports online data migration, and has strong scalability of backup clusters.

Description

technical field [0001] The invention belongs to the technical field of computer storage backup, in particular to a backup cluster based on two-level data deduplication. Background technique [0002] With the explosive growth of data, data disaster recovery and backup are facing unprecedented challenges. On the one hand, traditional data protection technologies such as periodic backups, snapshots, continuous data protection, and versioned file systems generate a large amount of duplicate data, which accelerates data growth and forces the storage capacity of the backup system to expand continuously, causing enterprises to face huge costs Stress and data management challenges. On the other hand, due to the increasingly stringent requirements of applications for data protection, the backup window is gradually shortened, and a large amount of data needs to be backed up online and recovered immediately after failure, which places extremely high requirements on system throughput a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F11/14
Inventor 杨天明周书臣杨志强吴海涛黄平樊宜和杨奕
Owner 苏州爱洛克信息技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products