Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Mass content storage system and method based on distributed big data blocks

A distributed storage and large data block technology, applied in the input/output process of data processing, digital data processing, structured data retrieval, etc., can solve problems such as inconvenient management, many file directories, and many file system files , to achieve the effect of supporting long-term storage, avoiding quantity restrictions, and reducing the number of files

Pending Publication Date: 2021-06-11
兴业数字金融服务(上海)股份有限公司
View PDF1 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] 1. A large number of small file metadata will occupy a large number of records in the relational database, resulting in a decline in the access performance of the relational database, data backup, and inconvenient management, especially for files with a long storage period
[0004] 2. A large number of small files are stored on the file system, resulting in too many file system files and too many file directories. To solve this problem, most products use hash to divide file directories, resulting in complex file structures and excessive management of file metadata by the operating system. many
[0005] 3. In order to ensure data reliability, SAN architecture storage is generally used to achieve unified storage and storage reliability, resulting in high cost, corresponding to massive data storage will become larger and larger, and long-term storage cannot be realized

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Mass content storage system and method based on distributed big data blocks
  • Mass content storage system and method based on distributed big data blocks
  • Mass content storage system and method based on distributed big data blocks

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0054] Such as figure 1 According to the massive content storage system based on distributed large data blocks provided by the present invention, it includes the following modules:

[0055] Distributed storage module: This module is responsible for providing distributed storage functions, integrating the local storage disks on the entire distributed cluster nodes into a unified distributed storage medium, and providing copy technology. The underlying layer is implemented by some distributed storage frameworks. It is implemented by distributed storage systems such as HDFS, GPFS, and Ceph. This module is responsible for formatting the integrated distributed storage and dividing it into fixed-size storage blocks, such as a 256M storage block. The storage block is a unit of distributed storage. , with multiple copies, see image 3 , Figure 4 .

[0056] File block management module: This module is responsible for managing the storage blocks that have been formatted from the dis...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a mass content storage system and method based on distributed big data blocks, and the system comprises a distributed storage module which integrates local storage disks on a whole distributed cluster node into a uniform distributed storage medium, formats the integrated distributed storage medium, and divides the integrated distributed storage medium into storage blocks with fixed sizes; a file block management module which is used for managing the formatted storage blocks and distributing the storage blocks to the file read-write module to read and write specific file contents; a file metadata module which is used for storing file metadata; a file reading and writing module which is responsible for reading and writing file contents; and a file block fragment arrangement module which is used for monitoring the fragment condition of the file storage block, arranging fragments of the file storage block, secondarily integrating effective storage into a complete storage block, and then releasing the original fragment storage block. According to the method, the small files are merged and stored by adopting the storage blocks, so that the number of the files is reduced, and the inode number limitation of a traditional file system is avoided.

Description

technical field [0001] The present invention relates to the technical field of data storage, in particular to a massive content storage system and method based on distributed large data blocks. Background technique [0002] Most of the existing technologies use content storage to realize enterprise content storage. Content storage generally stores file metadata in a relational database and stores files on a file system, such as IBM's CM and FileNet CE. However, the objective disadvantages of these content storages are: [0003] 1. A large number of small file metadata will occupy a large number of records in the relational database, resulting in the decline of relational database access performance, data backup, and inconvenient management, especially for files with a long storage period. [0004] 2. A large number of small files are stored on the file system, resulting in too many file system files and too many file directories. To solve this problem, most products use hash...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/182G06F16/172G06F3/06G06F16/27
CPCG06F16/182G06F16/172G06F3/064G06F16/27
Inventor 吝晓军
Owner 兴业数字金融服务(上海)股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products