Repeated data processing method, device, storage controller, and storage node

A storage controller and data duplication technology, applied in the storage field, which can solve the problems of high correlation uncertainty and lower deduplication rate.

Active Publication Date: 2015-12-30
HUAWEI TECH CO LTD
View PDF4 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, the inventors found that when the data streams received by the physical nodes are relatively scattered and the data I / O is small, the data correlation between each data stream has a high uncertainty, and the method of the prior art , will reduce the deduplication rate

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Repeated data processing method, device, storage controller, and storage node
  • Repeated data processing method, device, storage controller, and storage node
  • Repeated data processing method, device, storage controller, and storage node

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0101] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0102] figure 1 A storage system provided by an embodiment of the present invention, the storage system includes a storage controller, a storage device, and the storage device stores the corresponding relationship between the fingerprint representative value SID of the data block and the fingerprint value chunkID of the data block, The fingerprin...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

Disclosed is a repeated data processing method. In a corresponding relationship between representative values of fingerprints of data blocks and fingerprint values of the data blocks, the corresponding fingerprint values of the data blocks belonging to the same data stream are stored together, and are continuously stored in the corresponding relationship according to the sequence in the data stream; and in all-fingerprint comparison of the data blocks, the continuously stored fingerprint values are loaded into a memory and compared, so that the searching efficiency of repeating data is effectively improved.

Description

technical field [0001] Embodiments of the present invention relate to storage technology, and in particular to a method and device for processing repetitive data, a storage controller, and a storage node. Background technique [0002] Data deduplication, also known as intelligent compression or single instance storage, is a method that can automatically search for duplicate data, keep only one copy of the same data, and replace other duplicate copies with pointers to the single copy to eliminate redundancy Data storage technology that reduces storage capacity requirements. [0003] In the prior art, in order to improve the efficiency of deduplication, in the prior art, the data is usually aggregated to improve the interrelationship between data. When deduplication is performed, the physical node receiving the data stream usually performs Block to obtain several data blocks, group the obtained data blocks, and for each group, sample a part of the metadata information from th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
CPCG06F16/24568
Inventor 刘强
Owner HUAWEI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products