Fast distributed-storage multi-replica consistency checking method and storage architecture

A distributed storage and storage architecture technology, applied in the field of distributed storage multi-copy fast check consistency, can solve problems such as consuming large CPU and storage host bandwidth, wasting resources, affecting system performance, etc., to achieve concurrent computing, reduce The consumption of bandwidth and the effect of improving the speed of verification

Inactive Publication Date: 2018-01-26
深圳市联云港科技有限公司
View PDF4 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] But for large files, calculating the hash value will consume a lot of CPU and storage host bandwidth, seriously affecting the performance of the system
In addition, there are often few inconsistencies in the file, but calculating the hash value of the file requires reading the contents of the entire file, resulting in a huge waste of resources

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Fast distributed-storage multi-replica consistency checking method and storage architecture
  • Fast distributed-storage multi-replica consistency checking method and storage architecture
  • Fast distributed-storage multi-replica consistency checking method and storage architecture

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] In order to make the object, technical solution and advantages of the present invention more clear and definite, the present invention will be further described in detail below with reference to the accompanying drawings and examples. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0035] The embodiment of the present invention provides a method for quickly verifying the consistency of multiple copies of distributed storage, please refer to Figure 1-4 , as shown in the figure, by adopting a control host-storage host processing architecture.

[0036] Specifically include the following steps:

[0037] S100. Divide the stored file evenly into several data segments in advance, and each data segment is respectively provided with a corresponding first hash value, and is provided with a flag bit for indicating whether the corresponding first hash value is expired.

[0...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a fast distributed-storage multi-replica consistency checking method and a storage architecture. The method adopts a control host-storage host processing architecture, and includes: evenly dividing a stored file into a plurality of data segments in advance, and respectively setting an individually-corresponding first hash value and a mark bit, which is used for indicating whether the corresponding first hash value expires, for each data segment; when a writing request is received, calculating the corresponding mark bit according to offset and length of the writing request, and setting the mark bit to expiry; and screening out the expiry mark bit, updating the first hash value corresponding to the mark bit, and then calculating a second hash value of the entire fileaccording to the first hash value of each data segment. According to the method, the large file is divided into the plurality of data segments, the hash values of the file are calculated in a segmented manner, then the hash value of the entire file is calculated through the hash value of each segment, reading data of the entire file is avoided, thus a consistency detection speed is increased, andbandwidth consumption of storage hosts is reduced.

Description

technical field [0001] The present invention relates to the technical field of information storage, in particular to a method and a storage architecture for quickly verifying the consistency of multiple copies of distributed storage. Background technique [0002] With the advent of the information age, the amount of global data is in an explosive growth trend. Improving the reliability of storage systems and ensuring data availability have become the research focus of enterprises. Most of the existing distributed storage systems use multi-copy technology to improve system reliability, availability, performance and scalability. However, the distributed storage system communicates through the network, and the instability of the network may easily cause inconsistency of the back-end data, and the distributed storage system generally includes a large number of server hosts and disks, and the probability of hardware damage is relatively high. [0003] If the consistency of the ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F3/06G06F11/10
Inventor 陈仲涛
Owner 深圳市联云港科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products