A Fragment Rewriting Method Used in Data Deduplication System

A fragmentation and data technology, applied in the field of computer information storage, can solve the problems of reducing read performance, reducing, and fragmenting rewriting and deduplication rate.

Active Publication Date: 2016-08-31
HUAZHONG UNIV OF SCI & TECH
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The object of the present invention is to propose an optimized fragment rewriting method for the above defects or improvement needs of the prior art, which selectively rewrites data blocks determined to be fragments, thereby reducing unnecessary fragment data. Rewriting, to solve the technical problems of deduplication rate reduction and reading performance reduction caused by a large number of fragment rewriting existing in the current data deduplication system, compared with the existing fragment rewriting algorithm, it has higher read performance and Deduplication rate

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Fragment Rewriting Method Used in Data Deduplication System
  • A Fragment Rewriting Method Used in Data Deduplication System
  • A Fragment Rewriting Method Used in Data Deduplication System

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings. The description herein is only used to explain the present invention when referring to specific examples, and does not limit the present invention.

[0032] The method of the present invention can be applied to a system applying data deduplication technology, such as a backup storage system based on data deduplication technology, an archive storage system, a file system, and the like. For the convenience of description, in this embodiment, the method of the present invention is preferably described with a backup storage system applying data deduplication technology, but the method in the present invention is not limited to the above-mentioned backup storage system, and is also applicable to such as archive storage systems, In systems and methods such as a file syste...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a fragment rewriting method for a data deduplication system, which adds a cache as a rewrite-aware cache in the data deduplication, and a data item in the rewrite-aware cache is a container identifier referenced by a data block (Container ID), for a data block determined to be a data fragment, if the container ID (Container ID) referenced by it has been stored in the rewrite-aware cache, then the data does not need to be rewritten; otherwise, the data fragment is rewritten. The method of the invention adds a cache with the same size and the same cache strategy as the data read in the process of data deduplication, avoids unnecessary rewriting of repeated data blocks, and overcomes the defects of existing algorithms. Compared with the existing de-fragmentation algorithm, one is to improve the impact of data fragmentation on read performance and improve read performance by rewriting data fragmentation; the other is to ensure less deduplication rate while improving read performance loss.

Description

technical field [0001] The invention belongs to the field of computer information storage, and in particular relates to a fragment rewriting method based on a data deduplication technology system. Background technique [0002] Data deduplication technology (data deduplication technology), as a reduction technology that can identify and eliminate redundant data and store only a single copy of data, is widely used in backup storage systems, archive storage systems, and even file systems. For example, using data deduplication technology can eliminate 80% to 90% of redundant data in backup storage systems and archive storage systems, up to 80% of redundant data can be eliminated in virtual machine backup, and 3 / 4 file space overhead and 87% backup image overhead. [0003] However, in the system based on data deduplication, the data blocks of the subsequently stored files share the data blocks of the previously stored files, so that the data blocks are scattered rather than con...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F11/14G06F3/06
Inventor 刘景宁冯丹周鹏举许蔚付忞
Owner HUAZHONG UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products