Data archiving method for load sensing in erasure code storage

A data archiving and load-aware technology, applied to redundancy in computing for data error detection, response error generation, etc., can solve performance bottlenecks, single-strip archiving performance bottlenecks, etc. Filing efficiency, light load effect

Active Publication Date: 2015-03-04
HUAZHONG UNIV OF SCI & TECH
View PDF1 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] Aiming at the above defects or improvement needs of the prior art, the present invention provides a load-aware data archiving method in erasure code storage. It becomes a performance bottleneck in the process of single-strip archiving, and all archiving encoding tasks are completed by the archiving management node. The management node becomes the performance bottleneck of the entire archiving process.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data archiving method for load sensing in erasure code storage
  • Data archiving method for load sensing in erasure code storage
  • Data archiving method for load sensing in erasure code storage

Examples

Experimental program
Comparison scheme
Effect test

example

[0074] The load-aware archiving method in the erasure code storage of the method of the present invention is applied in a three-copy storage cluster. The following takes the parameter configuration "number of production nodes = 18, K = 6, N = 9" as an example to describe the above-mentioned archiving process in detail. It mainly includes the following steps:

[0075] (1) According to the difference between the network IO processing capacity and the network IO load of the nodes on the production cluster in the past T1 time period, initialize the load weight value of the production node, assuming that the load weight value of each node is .

[0076] (2) setting counter i=1;

[0077] (3) For the i-th stripe, initialize its archive node set ANS={};

[0078] (4) Read the data distribution bitmap from the management node in the erasure code storage cluster, and obtain the data distribution set DDS of the first strip as: {N 1 (D 2 ,D 5 ), N 2 (D 1 ,D 2 ,D 3 ), N 3 (D 3 ,D ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a data archiving method for load sensing in erasure code storage. The data archiving method comprises the steps: acquiring a load weight value of each node of a production cluster in an erasure code storage cluster, storing the load weight values into an array and according to the load weight values of the nodes, determining archiving node sets of different bands in the production cluster; initializing the archiving node set of an ith band into an empty set, selecting a node with the highest load weight value from a data distribution set of the ith band, judging whether the load weight value of the node is greater than the number of data blocks of the node, and if yes, filling the data blocks corresponding to the node into the archiving node set of the band and updating the load weight values of the related nodes; if no, selecting a node with the second highest load weight value from the data distribution set of the band. The data archiving method solves the problem of performance bottleneck caused by the case that in an existing method, code nodes do not consider node weight values and all data blocks on bands are only acquired from fixed nodes.

Description

technical field [0001] The invention belongs to the technical field of computer storage, and more specifically relates to a load-aware data archiving method in erasure code storage. Background technique [0002] Distributed storage clusters use redundant data provided by unreliable single nodes to ensure data reliability. In distributed storage environments such as GFS, HDFS, and amazon S3, a three-copy redundancy method is adopted. In the case of three copies, data reliability and load balancing characteristics can be well obtained. In a large-scale cluster, the defect brought by three copies is that the storage efficiency is low, and the storage cost is too high. For the cluster environment where data is written once and read many times, erasure code data archiving can effectively improve storage efficiency. In a large-scale cluster environment, nodes in the cluster are heterogeneous due to different I / O service requests. During the archiving process, the node with too ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F11/14
Inventor 黄建忠曹强谢长生代尔卫罗海兵
Owner HUAZHONG UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products