Unlock instant, AI-driven research and patent intelligence for your innovation.

A data layout method to improve the recovery performance of deduplication backup system

A technology of data deduplication and backup system, which is applied to data error detection, electronic digital data processing, and response error generation in the direction of redundancy in computing, and can solve the problem of irrespective of the specific storage address of valid data blocks and recovery performance. Without effective improvement, unable to accurately locate data fragments and other problems, achieve high deduplication rate and recovery performance, improve recovery performance, and high deduplication rate

Active Publication Date: 2019-05-17
CHONGQING UNIV
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although data recovery performance can be improved to a certain extent by rewriting such fragments, this fragmentation identification method only focuses on the total amount of valid data blocks in the container, without considering the specific storage address of each valid data block, and cannot accurately locate the data Fragmentation, resulting in excessive data rewriting, and recovery performance cannot be effectively improved

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A data layout method to improve the recovery performance of deduplication backup system
  • A data layout method to improve the recovery performance of deduplication backup system
  • A data layout method to improve the recovery performance of deduplication backup system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] figure 1 It is a schematic diagram of fragment identification in the present invention. The basic unit of fragment identification is the data group. Each data group consists of a certain amount of data blocks with adjacent storage addresses. If in a group, when restoring or reading a data set object (the data object can refer to a backup file or a backup data stream), the transmission speed of valid data blocks in this group is lower than the transmission speed expected by the user, Then the valid data blocks in the group are identified as data fragments, otherwise, the valid data blocks in the group are not data fragments.figure 1 The sum of the shaded parts in the middle is the total size x of valid data blocks in the group, including x1, x2, x3, x4, x=x1+x2+x3+x4. y represents the total amount of data required to read the valid data x, that is, the total amount of data stored between the minimum storage address and the maximum storage address of the valid data bloc...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention provides a data block storage address based data layout method to improve the recovery performance of a data de-duplication backup system. This method takes full account of the specific storage location of each data block and combines the bandwidth of a disk and the path seeking time to calculate the recovery speed of the data at the time of backup. If the recovery speed meets a user's requirement, the corresponding data does not belong to a data fragment. And corresponding data belongs to a data fragment the other way around. Different from the existing methods, this method is a data layout method based on data block storage addresses, and uses a more fine-grained fragmentation recognition approach to accurately identify each fragment. In this way, a higher recovery rate and data recovery performance than from other methods can be obtained.

Description

technical field [0001] The invention belongs to the technical field of computer information storage, and relates to a data layout method based on data block storage addresses to improve the recovery performance of a duplicate data deletion backup system. Background technique [0002] With the advent of the information age, data has grown explosively, and IDC predicts that 44ZB of data will be generated by 2020. The backup system needs to store more and more backup data. How to use limited storage resources to efficiently store PB-level or even EB-level data is an urgent problem to be solved. Data deduplication technology is an important technology to reduce data storage costs by eliminating redundant data on a large scale. Data deduplication technology is often used in data backup systems to delete duplicate stored data blocks in the backup system to save storage space. However, although this technology can save storage costs, after the duplicate data blocks are deleted, t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F11/14
CPCG06F11/1469
Inventor 谭玉娟文舰晏志超
Owner CHONGQING UNIV