Method for quickly merging backup points in data deduplication system

A technology of data deduplication and backup, applied to the redundancy in computing for data error detection, file system, digital data processing, etc., to increase self and overall performance, improve performance, and simplify the process of data merging. Effect

Pending Publication Date: 2022-02-22
CHINA TELECOM DIGITAL INTELLIGENCE TECH CO LTD
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0010] Aiming at the deficiencies in the prior art, the present invention provides a method for fast backup point merging in a deduplication system, which solves the main drawback of the backup point merging in the above-mentioned traditional deduplication system: data blocks must be merged from the deduplication system to the application and back to the deduplication system

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for quickly merging backup points in data deduplication system
  • Method for quickly merging backup points in data deduplication system
  • Method for quickly merging backup points in data deduplication system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] The present invention is described in further detail now in conjunction with accompanying drawing.

[0034] Before and after the source backup and the target backup are merged, the data blocks in the source backup and the target backup will not change at all. The most important difference here is that the target backup's index entries are updated to point to the new data blocks, not the old ones. To achieve the same result by improving performance, the key idea of ​​this application is to exchange index entries between source and target backups in a data deduplication system. The detailed process is described as follows:

[0035] 1. In the deduplication system, keep the data blocks unchanged, and only exchange index entries between the two backups (refer to the figure image 3 ).

[0036] 2. After the swap, the new indexes are already in the target backup, but they still point to the new references and data blocks. Additionally, the old index also points to old refe...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for quickly merging backup points in a data deduplication system. The method comprises the following steps that: 1, a data backup application starts a backup point merging task module in a data deduplication system; 2, the backup point merging task module receives an instruction and obtains a source backup and a target backup which need to be subjected to content merging; 3, the backup point merging task module detects whether data blocks with the same content exist in the source backup and the target backup or whether data blocks needing to be updated and replaced by the target backup exist in the source backup; and if yes, the index entries corresponding to the data blocks meeting the conditions in the source backup and the index entries corresponding to the data blocks meeting the conditions in the target backup file are exchanged; and 4, the backup point merging task module marks and deletes the exchanged indexes in the source backup, the reference corresponding to the indexes and the data blocks. The problem that a data block must be transmitted to the application from the data deduplication system and then returned to the data deduplication system is solved. And meanwhile, the bandwidth can be reduced in a cloud scene.

Description

technical field [0001] The invention relates to the technical field of data storage, in particular to a method for quickly merging backup points in a data deduplication system. Background technique [0002] A typical data deduplication system saves the source data of a backup session as three types of files, namely index files, reference files, and data files. Data files are used to save data blocks, and reference files are used to record the metadata of these data blocks. Therefore, each data file must have one and only one corresponding reference file. Index files, on the other hand, are considered representations of the source data, they do not contain real data, but they can tell where the real data can be found by pointing to reference files. Obviously, entries in the index file can point to multiple reference files, so that subsequent backups can refer to the backed up data blocks, thereby achieving deduplication (see figure 1 ). [0003] Suppose backup 2 has 3 uni...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/174G06F16/16G06F16/13G06F11/14
CPCG06F16/1748G06F16/162G06F16/13G06F11/1453
Inventor 魏小进陈世亮丁涛朱庭俊苏莉莉叶萌卓祖金
Owner CHINA TELECOM DIGITAL INTELLIGENCE TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products