High-efficiency and high-reliability big data storage system and method and computer program

A big data storage and reliable technology, applied in the redundant operation of data error detection, calculation, electrical digital data processing and other directions, can solve the limitation of the number of strips or strips, the storage efficiency is less than 20%, and the fault-tolerant technology Problems such as low space utilization

Pending Publication Date: 2019-11-15
CHENGDU UNIV OF INFORMATION TECH
View PDF2 Cites 20 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although this type of array code has improved fault tolerance, it will face various problems when it is applied to a distributed storage system. It can only be realized if there is a linear relationship with the prime number, which undoubtedly brings great restrictions to the expansion of the distributed storage system, which does not match the easy scalability of the distributed storage system
Second, storage efficiency will decrease as the number of fault tolerances increases
For example, if we design a Weaver code that can tolerate 10 errors, its storage efficiency is less than 20%.
Therefore, these problems lead to the low practicality of array codes in the field of distributed storage systems
[0007] To sum up, the problems existing in the existing technology are: the space utilization rate of the fault-tolerant technology in the current distributed storage system is low, the storage efficiency is low, the reliability is poor, the scalability is poor, and the practicability is not high
[0009] One is that there are restrictions on the number of stripes or blocks, which can only be realized if the stripes or blocks are prime numbers or have a linear relationship with prime numbers. This undoubtedly brings great restrictions on the expansion of the distributed storage system. The ease of expansion of the storage system does not match
Second, storage efficiency will decrease as the number of fault tolerances increases
For example, if we design a Weaver code that can tolerate 10 errors, its storage efficiency is less than 20%.
When expanding a large-scale distributed storage system, if you still expand according to prime numbers, you need to add more nodes or virtual nodes. More nodes mean an increase in the cost of a system expansion. More virtual nodes require Waste more calculations and affect efficiency

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • High-efficiency and high-reliability big data storage system and method and computer program
  • High-efficiency and high-reliability big data storage system and method and computer program
  • High-efficiency and high-reliability big data storage system and method and computer program

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0123] In order to make the object, technical solution and advantages of the present invention more clear, the present invention will be further described in detail below in conjunction with the examples. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0124] The present invention aims at the present situation of low space utilization rate, low efficiency, poor reliability and poor expansibility of the fault-tolerant technology in the current distributed storage system, and provides a storage system with optimal storage efficiency (that is, having the property of MDS), high efficiency, high reliability, and easy expansion big data solutions.

[0125] Such as figure 1 As shown, the efficient and highly reliable big data storage system provided by the embodiment of the present invention includes: a data upload module 101, an array module 102, an encoding module 103, a hear...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the technical field of information data processing, and discloses a high-efficiency and high-reliability big data storage system and method, and a computer program. The high-efficiency and high-reliability big data storage method includes the steps: a configuration center is set to obtain the data uploaded by a user; an arraying module is used for calculating a coded array; the array calculated by the array module is read, and encoding is performed according to the array; heartbeat information is sent between the master node and the slave nodes, and heartbeat delay time of the configuration center is read; failure node information is analyzed; data recovery is carried out; the position of each original data block is queried according to the ID of the data to be downloaded; the setting of the configuration center is read for distributing the coded block storage nodes and newly-generated nodes selected during data recovery; the configuration center is responsiblefor managing various parameters in the system; and the coded blocks are stored. The invention provides a big data storage solution with optimal storage efficiency, high efficiency, high reliability and easy expansion.

Description

technical field [0001] The invention belongs to the technical field of information data processing, and in particular relates to an efficient and highly reliable large data storage system, method and computer program. Background technique [0002] Currently, the closest prior art: [0003] With the continuous development of information technology in various industries and fields, the data volume shows an exponential growth trend, and the rapid growth of data volume brings about data storage reliability and data access concurrency efficiency issues. A generally effective method is to construct a distributed storage system composed of multiple data nodes (a data node can be a PC or a server, etc., which can be used as a data storage device). The continuous increase in the amount of stored data in various industries has led to an increasing scale of distributed storage systems and an increasing number of nodes. Some existing enterprises already have multiple storage systems w...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F11/10G06F11/14G06F16/182
CPCG06F11/1004G06F11/1446G06F16/182
Inventor 唐聃袁炜蔡红亮高燕刘善政曾琼
Owner CHENGDU UNIV OF INFORMATION TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products