Unlock instant, AI-driven research and patent intelligence for your innovation.

An Adaptive Coding Storage Error Tolerance Method Based on Simple Regenerating Codes

A simple regeneration code, adaptive coding technology, applied in special data processing applications, instruments, computing and other directions, to achieve the effect of strong versatility and scalability, improve storage efficiency, and reduce repair costs

Inactive Publication Date: 2019-09-17
NANJING UNIV +3
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

For cold data, since it is not accessed very frequently, it is less likely to cause degenerate reads due to data unavailability

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • An Adaptive Coding Storage Error Tolerance Method Based on Simple Regenerating Codes
  • An Adaptive Coding Storage Error Tolerance Method Based on Simple Regenerating Codes
  • An Adaptive Coding Storage Error Tolerance Method Based on Simple Regenerating Codes

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] The present invention will be further described below in conjunction with the accompanying drawings.

[0038] The present invention introduces an adaptive coding storage fault-tolerant method based on simple regenerative codes based on HDFS (Hadoop Distributed File System, Hadoop distributed file system). The present invention comprises the steps:

[0039] Step 1: Set the access frequency of data files in the distributed file system HDFS to 10, the overall storage overhead of the system is B, the first time period is 24h, and the second time period is 24h; the time period can be determined according to the actual situation Depending on the situation, it can be set to one day, one week or longer. The access frequency threshold A can be set according to the length of the time period. If the time period is one day, the access frequency threshold can be set to 10.

[0040] Step 2: In the initial stage of file writing, the probability of the file being accessed later is ve...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a self-adaptive encoding storage fault-tolerant method based on a simple regenerating code. The method comprises the following four main steps: 1) saving a file stored in the HDFS in a three-copy mode; 2) after one time slot, converting the file into a simple regenerating code fast code encoding; 3) after the file access is reduced, converting the file into the simple regenerating code compact code encoding; 4) possibly generating the condition that the file encoding states in the step 2) and step 3) are mutually converted along the dynamic attribute of the file in the running. Through the simultaneous adoption of the two encoding states of simple regenerating code fast code compact code and the fast code in the distributed file system HDFS, the saving expenditure and the repair cost are compromised; the self-adaptive encoding method based on the simple regenerating code is introduced into the HDFS so as to establish the self-adaptive encoding mechanism in combination with the file state and the system state; the integral saving efficiency of the distributed storage system is improved, and the repair cost is lowered.

Description

technical field [0001] The invention relates to the field of distributed storage, in particular to an adaptive encoding and storage fault-tolerant method based on simple regeneration codes. Background technique [0002] Since the use of erasure codes can achieve high reliability with low storage overhead, large-scale distributed storage systems have begun to use erasure codes. However, most online storage systems use only one erasure code, and the erasure code actually used in the system may be optimized in terms of reducing repair bandwidth overhead or reducing storage overhead. For example, RS coding is a widely used type of erasure code, which has been used in many storage systems such as Google's ColossusFS and FaceBook's HDFS-Raid. The RS code used in ColossusFS has a storage overhead of 1.5x, requiring 6 disk reads and network transfers to repair a data block. The storage overhead of the RS code used in HDFS-Raid is 1.4x, but repairing a data block requires reading o...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/182
Inventor 朱定亚叶保留陆桑璐
Owner NANJING UNIV
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More