Repeating data processing methods, devices, storage controller and storage node

A storage controller and data duplication technology, applied in the storage field, can solve problems such as high correlation uncertainty and lower deduplication rate

Active Publication Date: 2014-06-11
HUAWEI TECH CO LTD
View PDF4 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, the inventors found that when the data streams received by the physical nodes are relatively scattered and the data I / O is small, the data correlation between each data stream has a high uncertainty, and the method of the prior art , will reduce the deduplication rate

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Repeating data processing methods, devices, storage controller and storage node
  • Repeating data processing methods, devices, storage controller and storage node
  • Repeating data processing methods, devices, storage controller and storage node

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0101] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0102] figure 1 A storage system provided by an embodiment of the present invention, the storage system includes a storage controller, a storage device, and the storage device stores the corresponding relationship between the fingerprint representative value SID of the data block and the fingerprint value chunk ID of the data block , the fingerpr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

According to repeating data processing methods provided by the embodiments of the invention, in corresponding relationships between representative values of fingerprints of data partitioned blocks and fingerprint values of the data partitioned blocks, corresponding figerprinter values of the data partitioned blocks belonging to same data streams are stored together and are continuously stored in the corresponding relationships according to sequences in the data streams; and, in all-fingerprint comparasion of data partitioning, the continuously stored fingerprint values are loaded into an internal storage to perform comparasion, and repeating data searching efficiency is effectively improved.

Description

technical field [0001] Embodiments of the present invention relate to storage technology, and in particular to a method and device for processing repetitive data, a storage controller, and a storage node. Background technique [0002] Data deduplication, also known as intelligent compression or single instance storage, is a method that can automatically search for duplicate data, keep only one copy of the same data, and replace other duplicate copies with pointers to the single copy to eliminate redundancy Data storage technology that reduces storage capacity requirements. [0003] In the prior art, in order to improve the efficiency of deduplication, in the prior art, the data is usually aggregated to improve the interrelationship between data. When deduplication is performed, the physical node receiving the data stream usually performs Block to obtain several data blocks, group the obtained data blocks, and for each group, sample a part of the metadata information from th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/24556G06F16/24568
Inventor 刘强
Owner HUAWEI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products