Repeating data deleting method and device

A technology of data deduplication and data block, which is applied in the direction of electrical digital data processing, special data processing applications, instruments, etc., and can solve problems such as duplicate data deletion by mistake

Active Publication Date: 2012-08-08
INSPUR BEIJING ELECTRONICS INFORMATION IND
View PDF3 Cites 39 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The invention provides a method and device for deduplication data, whic

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Repeating data deleting method and device
  • Repeating data deleting method and device
  • Repeating data deleting method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] In the existing data deduplication technology, problems such as accidental deletion and heavy system computing pressure are prone to occur. In order to solve the above-mentioned problem, an embodiment of the present invention provides a data deduplication method. Hereinafter, the embodiments of the present invention will be described in detail with reference to the accompanying drawings. It should be noted that the embodiments in the application and the features in the embodiments can be combined with each other arbitrarily if there is no conflict.

[0037] First, the first embodiment of the present invention will be described.

[0038] The embodiment of the present invention provides an efficient and safe data de-duplication method, using the method to complete the de-duplication process such as figure 1 Shown, including:

[0039] Step 101: Divide the stored data into data blocks of variable sizes;

[0040] figure 2 In order to improve the flow chart of the hash algorithm, ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a repeating date deleting method and a device, and belongs to the field of application of computers. The utility model solves the problem of error detection to repeating date during the deleting process. The method comprises the followings: generating finger prints of all data blocks, when a plurality of data blocks with the same finger prints exist, a comparison is made to the data blocks with the same finger prints, which are called out from backend storage; when the data blocks with the same finger prints are the same completely, repeating data deletion is performed on the data blocks. The invention provides the technical scheme which is suitable for large scale data storage, and accurate data deletion is achieved.

Description

Technical field [0001] The present invention relates to the field of computer applications, in particular to a method and device for deleting duplicate data. Background technique [0002] With the rapid growth of terabytes or even petabytes of data storage, the concept of green storage based on deduplication has been proposed. Data deduplication can reduce storage system procurement costs, save power, and reduce heat dissipation. With the development of storage virtualization, block-level data deduplication technology has become the mainstream technology. The process of deduplication requires the use of a hash algorithm to generate the fingerprint of the data block, and then compare the fingerprints to determine whether it is a duplicate data block. The data block with the same fingerprint is regarded as the same data block, and the same data block is established An index points a new data block to the same data block that already exists; different data blocks need to be stored...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
Inventor 张砚波周龙飞
Owner INSPUR BEIJING ELECTRONICS INFORMATION IND
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products