Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A distributed storage system and method for storing backup data

A technology for distributed storage and backup of data, applied in the input/output process of data processing, redundant data error detection in operations, digital data processing, etc. space and other problems, to reduce the recovery window period, solve the low utilization rate, and improve the indexing speed.

Inactive Publication Date: 2019-01-29
ZHENGZHOU YUNHAI INFORMATION TECH CO LTD
View PDF5 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, the current backup system has the following problems in supporting disk devices: mainstream backup software uses tapes to manage disks, and the disk space will be recovered only after all the data stored on the disk expires, which greatly wastes disk space
For file system backup, if the file index is used for management, a large number of files will make it difficult to manage the index of the backup software. After a large number of files are backed up for a long time, the index space of the backup system will skyrocket, accompanied by significant performance degradation
When restoring the incremental backup of massive files, it is necessary to restore the incremental one at a time, which will consume a lot of time. Restoring the files in the browsing cycle requires cumbersome index queries. If the file index is used, the efficiency of the index is very low, and the index structure is very bloated
If a small open source database is used to store indexes, the stability of the database will decrease when there are too many files, which is not conducive to maintenance. If a large commercial database is used to store indexes, high additional costs will be required
The backup and restoration of data is only supported by a single storage device. When the backup domain is large, performance bottlenecks of the backup device are likely to occur, especially the throughput bottleneck of the backup device, which may cause multiple backup jobs to fail in parallel Complete the backup task within the specified backup window
The storage space expansion capability of a single storage device is limited, and the vertical space expansion will lead to a decrease in the processing performance of the backup device, thereby reducing the performance of the backup device. There is a single point of failure in the backup device. If the hardware of the backup device is damaged, it has to be repaired before the hardware is repaired. Interrupt system backup jobs

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A distributed storage system and method for storing backup data
  • A distributed storage system and method for storing backup data
  • A distributed storage system and method for storing backup data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0052] like figure 1 Shown: Embodiment 1 of the present invention provides an overall structure diagram of a distributed storage system for storing backup data.

[0053] A distributed storage system for storing backup data includes cluster management nodes, metadata management nodes and storage nodes. In Embodiment 1, there are 3 cluster management nodes, 2 metadata management nodes, primary metadata management nodes and backup metadata management nodes, and 4 storage nodes. However, the structure of a distributed storage system for storing backup data protected by the present invention is not limited to this embodiment, and may also be other structures.

[0054] The cluster management node is used to provide cluster management services, as well as the election and arbitration of metadata management nodes. When a cluster management node goes down, the remaining nodes in the cluster will elect a cluster management node again.

[0055] The metadata management node is used to m...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a distributed storage system and a method for storing backup data, which uses a distributed storage system mode instead of an index mode to manage the backup data. A distributedstorage method includes a method for storage of backup data, a method of data recovery and a method of backing up data garbage collection. The data storage method can greatly utilize the advantages of high IOPS of disk device and high concurrency of cluster, the system design has redundancy, the system will not easily collapse, and saves a lot of environment of optimizing the backup server's owndata, which is convenient for system administrator to carry on system management and saves the authorization cost of the database. And different from the traditional backup system index management, itdoes not need to simulate the tape to manage the file system space, so that the data can be recovered in time when it is invalid, which improves the utilization of storage spa ce.

Description

technical field [0001] The invention relates to the technical field of data backup, in particular to the related technologies of distributed storage and file systems, in particular to a distributed storage system and method for storing backup data. Background technique [0002] With the development of new technologies such as cloud computing and big data, the amount of data generated by business systems is increasing exponentially, so the requirements for IPO and ITO of backup systems are getting higher and higher. At the same time, the development speed of disk technology far exceeds that of tape media, and more and more devices use disk devices instead of tape devices as the preferred backup medium. [0003] However, the current backup system's support for disk devices has the following problems: mainstream backup software all use tapes to manage disks, and disk space will only be recovered after all data stored on the disks expire, which greatly wastes disk space. . For...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F3/06G06F11/14
CPCG06F3/0604G06F3/0638G06F3/067G06F11/1402
Inventor 靖尧王承龙
Owner ZHENGZHOU YUNHAI INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products