A method to improve the deduplication performance of large data blocks

Inactive Publication Date: 2018-10-19
EISOO SOFTWARE
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In view of the shortcomings of the prior art described above, the purpose of the present invention is to provide a method for improving the deduplication performance of large data blocks, which is used to solve the problems of poor performance and large cache occupation in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method to improve the deduplication performance of large data blocks

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0012] Embodiments of the present invention are described below through specific examples, and those skilled in the art can easily understand other advantages and effects of the present invention from the content disclosed in this specification. The present invention can also be implemented or applied through other different specific implementation modes, and various modifications or changes can be made to the details in this specification based on different viewpoints and applications without departing from the spirit of the present invention. It should be noted that, in the case of no conflict, the following embodiments and features in the embodiments can be combined with each other.

[0013] It should be noted that the diagrams provided in the following embodiments are only schematically illustrating the basic ideas of the present invention, and only the components related to the present invention are shown in the diagrams rather than the number, shape and shape of the compo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method for improving large data block duplicated data deletion performance. The method comprises the steps that 1, a data block is obtained; 2, the data block is divided into data block fragments with the same fixed length; 3, fingerprint calculation is conducted on the data block fragments, and corresponding data block fragment fingerprints are obtained; 4, comparison is conducted on the data block fragment fingerprints and the existing data block fragment fingerprints, whether consistent fingerprints exist or not is judged, if not, a result of not inquiring the data block fingerprints is returned, inquiring is finished, and if yes, the fifth step is conducted; 5, fingerprint calculation is conducted on the data block, and the corresponding data block fingerprints is obtained; 6, comparison is conducted on the data block fingerprints and the existing data block fingerprints, whether consistent fingerprints exist or not is judged, if not, a result of not inquiring the data block fingerprints is returned, inquiring is finished, and if yes, buffer information of the data block fingerprints is stored, and inquiring is finished. The calculated performance is improved by reducing calculated amount of the fingerprints, and the data block duplicated data deletion performance is improved.

Description

technical field [0001] The invention relates to the field of deduplication of data, in particular to a method for improving the performance of deduplication of large data blocks. Background technique [0002] With the continuous development of computers, more and more data are stored in user computers, and the protection of these data becomes a difficult problem faced by users. In response to this problem, many vendors have introduced data deduplication solutions. The deduplication principles of different vendors are basically similar, but there are great differences in performance. The key factor of performance will determine the amount of data that users can protect and the protection period. Good performance allows users to better solve data protection problems. [0003] Data block deduplication is a scheme of deduplication, which is processed based on the data block level. Its data deduplication granularity is usually large, most of which are several megabytes to ten...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
CPCG06F16/1752
Inventor 吴植民
Owner EISOO SOFTWARE
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products