Fragment rewriting method for data repetition removing system

A data and fragmentation technology, which is applied to the redundancy in the operation for data error detection, response error generation, input/output to the record carrier, etc. question

Active Publication Date: 2013-12-25
HUAZHONG UNIV OF SCI & TECH
View PDF3 Cites 30 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The object of the present invention is to propose an optimized fragment rewriting method for the above defects or improvement needs of the prior art, which selectively rewrites data blocks determined to be fragments, thereby reducing unnecessary fragment data. Rewriting, to solve the technical problems of deduplication rate reduction and reading performance reduction caused by a large number of fragment rewriting existing in the current data deduplication system, compared with the existing fragment rewriting algorithm, it has higher read performance and Deduplication rate

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Fragment rewriting method for data repetition removing system
  • Fragment rewriting method for data repetition removing system
  • Fragment rewriting method for data repetition removing system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings. The description herein is only used to explain the present invention when referring to specific examples, and does not limit the present invention.

[0032] The method of the present invention can be applied to a system applying data deduplication technology, such as a backup storage system based on data deduplication technology, an archive storage system, a file system, and the like. For the convenience of description, in this embodiment, the method of the present invention is preferably described with a backup storage system applying data deduplication technology, but the method in the present invention is not limited to the above-mentioned backup storage system, and is also applicable to such as archive storage systems, In systems and methods such as a file syste...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a fragment rewriting method for a data repetition removing system. A cache as a rewriting sensing cache is added in data removing; data items in the rewriting sensing cache are container identifications referenced by a data block; and if the container identifications referenced by the data block which is confirmed as a data fragment are stored in the rewriting sensing cache, data are not required to be rewritten, or else, the data fragment is required to be rewritten. According to the fragment rewriting method, the cache of which the time is the same with the reading time of the data and the cache strategies are the same is added in the data repetition removing process, unnecessary repeated data block rewriting is avoided, and the shortcoming of the existing arithmetic is overcome. Compared with the existing fragment removing arithmetic, the fragment rewriting method has the advantages that influence on reading performance due to the data fragment is reduced, and the reading performance is improved by rewriting the data fragment; and less repetition rate loss is guaranteed while the reading performance is improved.

Description

technical field [0001] The invention belongs to the field of computer information storage, and in particular relates to a fragment rewriting method based on a data deduplication technology system. Background technique [0002] Data deduplication technology (data deduplication technology), as a reduction technology that can identify and eliminate redundant data and store only a single copy of data, is widely used in backup storage systems, archive storage systems, and even file systems. For example, using data deduplication technology can eliminate 80% to 90% of redundant data in backup storage systems and archive storage systems, up to 80% of redundant data can be eliminated in virtual machine backup, and 3 / 4 file space overhead and 87% backup image overhead. [0003] However, in the system based on data deduplication, the data blocks of the subsequently stored files share the data blocks of the previously stored files, so that the data blocks are scattered rather than con...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F11/14G06F3/06
Inventor 刘景宁冯丹周鹏举许蔚付忞
Owner HUAZHONG UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products