Migration and erasure code-based reconstruction coupled rapid prediction repair method and implementation
A repair method and erasure code technology, applied in the field of fast predictive repair, can solve problems such as the inability to achieve fast data repair, and achieve the effect of ensuring reliable data storage and fast data repair
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0056] like figure 1 As shown, a fast predictive repair method coupled with migration and reconstruction based on erasure codes, including:
[0057] S1. Use node failure prediction algorithm to monitor node status;
[0058] S2. Organize the blocks on the node about to fail into multiple reconstruction sets;
[0059] S3. According to the number of blocks included in the reconstruction set, determine a coupling strategy for migration and erasure code-based reconstruction.
[0060] The reconstruction method based on erasure code needs to read k blocks from other k healthy nodes when repairing a block. In actual storage system deployment, in order to reduce the network overhead required for repair, the value of k is generally selected to be small (for example, in the Facebook F4 storage system, the value of k is 10).
[0061] Existing large-scale storage systems are usually built on hundreds or even thousands of storage servers (referred to as "nodes"). Therefore, in this embod...
Embodiment 2
[0064] On the basis of Embodiment 1 of the present invention, the use of node failure prediction algorithm to monitor node status includes:
[0065] Step 1.1. Monitor and collect the attribute values returned by the SMART module;
[0066] Step 1.2. Input the relevant attribute values, and use the node failure prediction algorithm to calculate the node failure probability. If the node failure probability exceeds a given threshold, it is judged that the node is about to fail.
Embodiment 3
[0068] On the basis of Embodiment 1 of the present invention or Embodiment 2 of the present invention, the organization of the blocks on the about-to-be-failure node into multiple reconstruction sets includes:
[0069] Obtain the metadata information of the system, mainly including the storage node ID of all blocks in each stripe, and the logical block number of each block in the stripe to which it belongs. The logical block number is used to determine the reconstruction time based on the erasure code. Decoding parameters to be used;
[0070] Determine all repaired blocks stored on the node about to fail and the stripes they belong to;
[0071] Organize the repaired block into a plurality of reconstruction sets, and determine the ID of the healthy node selected to read data when each repaired block is reconstructed by using the erasure code.
[0072] In a specific embodiment, the specific steps of organizing the blocks on the node about to fail into multiple reconstruction se...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com