Check patentability & draft patents in minutes with Patsnap Eureka AI!

File repeat data delete method and device

A technology for deduplication and file, which is applied in electrical digital data processing, special data processing applications, instruments, etc. It can solve the problem of difficulty in determining the size of data blocks and low speed of deduplication, so as to eliminate the impact and speed up the speed. Effect

Active Publication Date: 2016-03-30
IBM CORP
View PDF7 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] Although the content-based segmentation method and the sliding block segmentation method can solve this problem, it is difficult to determine the size of the data block, resulting in slower data deduplication than the fixed-length segmentation method.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • File repeat data delete method and device
  • File repeat data delete method and device
  • File repeat data delete method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] Preferred embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although preferred embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the disclosure to those skilled in the art.

[0023] figure 1 A block diagram of an exemplary computer system / server 12 suitable for use in implementing embodiments of the invention is shown. figure 1 The computer system / server 12 shown is only an example and should not impose any limitation on the functions and scope of use of the embodiments of the present invention.

[0024] Such as figure 1 As shown, computer system / server 12 takes the form of a general purpose computing device. ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a file repeat data delete method and device; the method comprises the following steps: firstly cutting a file into at least one combination data block, wherein the combination data block comprises a fixed length block and a variable length block, and the variable length block is determined according to the file; carrying out repeat data delete operations for the at least one combination data block. The method can cut the file into a plurality of combination data blocks containing fixed and variable length blocks, thus fast executing repeat data delete operation with high efficiency.

Description

technical field [0001] The present invention relates to deduplication technology, and more specifically, to a method and device for deduplication of files. Background technique [0002] Data deduplication is a data reduction technique widely used in the data backup and archiving process, which eliminates redundant data by removing duplicate data in data sets (for example, files), thereby reducing the storage capacity used in the storage space . [0003] Generally, redundant data can be divided into three types: file-level redundant data, data block-level redundant data, and byte-level redundant data. In file-level types, redundant data is the entire file, which means that the file is duplicated with other files. In the block-level type, redundant data is certain data blocks within a file, which means that there are identical data blocks between different files. In byte-level types, redundant data is finer-grained data represented by bytes. [0004] Corresponding data ded...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
CPCG06F16/1752
Inventor 朱国峰
Owner IBM CORP
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More