Forest type storage structure and method for distributed erasure code hybrid storage based on multiple storage media

A technology of distributed storage and storage media, which is applied in the field of forest-type storage structure of distributed erasure code hybrid storage, can solve the problems of unaffordable cost of three copies, high unit storage price, high HDD price, etc., and achieve the solution of storage media The effect of shortened life, easy expansion, and improved fault tolerance

Active Publication Date: 2019-12-03
XI AN JIAOTONG UNIV
View PDF11 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] Currently, SSDs are more expensive than HDDs due to high material costs and complex manufacturing.
The pure SSD system has excellent performance but the unit storage price is expensive, and the traditional three-copy storage technology leads to only 1 / 3 of the actual data storage utilization rate, and the high price of SSD cannot bear the cost of three copies at all; distributed storage The storage utilization rate of the erasure code storage method is relatively high, which can adapt to the expensive cost of SSD
However, due to the inherent coding properties of erasure codes, erasure codes will generate large write amplification during the use of distributed storage, resulting in excessive wear and tear of SSDs, reduced life expectancy, and reduced reliability.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Forest type storage structure and method for distributed erasure code hybrid storage based on multiple storage media
  • Forest type storage structure and method for distributed erasure code hybrid storage based on multiple storage media

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0043] In this embodiment, a distributed erasure code hybrid storage method based on multiple storage media includes the following steps:

[0044] Step 1: The client node sends the data to be stored to the storage node, and the storage node performs erasure coding on the data to be stored according to the selected erasure coding coding rules to generate erasure coding data. According to the erasure code encoding rules and the selected encoding matrix, the storage node classifies and divides the encoded data into data block data and check block data, and stores the classified data block data in the solid-state hard disk , and store the check block data to the mechanical hard disk to realize the hybrid storage process of the erasure code based on multiple storage media. Specific steps are as follows:

[0045] 1) In step 1, the storage node encodes the client data by using the existing erasure code encoding method, such as RS erasure code and other existing systematic erasure co...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a forest type storage structure and a method for a distributed erasure code hybrid storage based on multiple storage media, which are characterized in that data block data of erasure codes are placed in a solid state disk for storage in a distributed storage system, and data check block data of the erasure codes are placed in a mechanical hard disk for storage. The method comprises the following steps: (1) classifying data storage media in a distributed storage system, and establishing a forest type hybrid storage structure; (2) classifying erasure code data in the distributed storage system into data block data and check block data, and marking the data block data and the check block data; and (3) placing the classified erasure code data on a specific tree of a forest type storage structure for distribution and disk falling. In this way, hybrid architecture storage of erasure code data on distributed storage based on multiple storage media is achieved. According to the invention, excessive wear of SSD caused by erasure code write amplification can be solved, the system performance is improved with lower cost, the service life is prolonged, and the reliability is enhanced.

Description

technical field [0001] The invention relates to the field of distributed storage, in particular to a forest-type storage structure and method based on multiple storage media for distributed erasure code hybrid storage. Background technique [0002] Large-capacity, low-cost, high-performance storage system design has always been a hotspot in the field of storage research. As humans have entered the era of big data, the explosive growth of data volume has put forward higher requirements for storage systems. Traditional data management The method has also encountered great challenges, and big data technology has gradually emerged. A very important point in big data technology is how to store and manage big data quickly and efficiently. On the one hand, the storage system must be able to achieve large-capacity storage at low cost, and on the other hand, the performance gap between storage and computing is constantly expanding, which requires high-performance data that matches t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F3/06
CPCG06F3/061G06F3/0631G06F3/067
Inventor 董小社李征张兴军王宇菲
Owner XI AN JIAOTONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products