A lightweight approach to autonomous block management for data-intensive file systems

A data-intensive, file system technology, applied in the direction of electrical digital data processing, data processing input/output process, instruments, etc., can solve the problem that the management method is not suitable for data-intensive file system management, increase address and other metadata maintenance cost, reduce the scalability of the master node, etc., to achieve the effect of improving recoverability and scalability, reducing storage space overhead, and improving scalability

Active Publication Date: 2019-11-12
SHANGHAI MARITIME UNIVERSITY
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] It can be seen from the above analysis that the traditional file system management methods are not suitable for the management of data-intensive file systems. The main reasons are: 1) With the continuous increase of data volume, the storage of file data block address tables will occupy a large amount of storage space ; 2) The master node is responsible for the maintenance of the file data block address table. With the continuous increase of the file data block address table, the processing capacity of the master node is greatly reduced; 3) The continuous increase of the data volume not only takes up a large amount of storage space of the master node , which increases the maintenance cost of metadata such as addresses, and also reduces the scalability of the master node; 4) Each data storage node must first consult the master node when storing and querying, which increases the addressing time

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A lightweight approach to autonomous block management for data-intensive file systems
  • A lightweight approach to autonomous block management for data-intensive file systems
  • A lightweight approach to autonomous block management for data-intensive file systems

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] In order to make the technical means, creative features, goals and effects achieved by the present invention easy to understand, the following will further explain the autonomous block management of a light-weight data-intensive file system proposed by the present invention in combination with diagrams and specific embodiments method.

[0040] A lightweight autonomous block management method for data-intensive file systems, which implements the mapping from data blocks to data nodes and from data nodes to data blocks through a set of reversible mathematical functions. Such as figure 2 As shown, the division of the specific functions of each node in the present invention: the master node is only responsible for the maintenance of the system namespace, the distribution of data blocks to data storage nodes, and the management of each data storage node; each data storage node is responsible for the consistency check of data blocks, Data block recovery and mapping informat...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an autonomous block management method for a lightweight data-intensive file system. The method is characterized in that the mapping of a data block to a data storage node, quick search of the data block in the data storage node, fast recovery of the data block when the data storage node is in failure, fast redistribution of the data block in the process of newly adding the data storage node and the like are realized by means of ISD (Intersected Shifted Declustering), so that a main node is only in charge of the storage and maintenance of file namespace in the data-intensive file system, but the storage and maintenance of mapping relationship information between the data block and the data storage node, the replacement of the data block when the data storage node is in failure, the redistribution of the data block during the process of adding a new data storage node and the like are all autonomously completed by the data storage node. The method disclosed by the invention has the advantages that memory space of the main node in the data-intensive file system is saved, the processing capability of the main node is improved, and the data block management efficiency of the data-intensive file system under a big-data environment can be dramatically improved.

Description

technical field [0001] The invention relates to computer security technology, in particular to a light-weight autonomous block management method of a data-intensive file system. Background technique [0002] Data-intensive file system DiFS, such as Google File System GFS, Hadoop Distributed File System HDFS, etc., has become the main file system for big data storage management. The current data-intensive file system DiFS adopts a master-slave architecture. The master node (metadata server) manages all metadata, and the slave node (data storage node) is only responsible for data storage. In order to maintain high availability, these storage systems usually divide data files into blocks of fixed size, each data block usually has 3 copies, and distribute them to different data storage nodes of the cluster. The master node must record the addresses of hundreds or thousands of data storage nodes, as well as record the mapping information from the data blocks of all data files to...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F3/06
CPCG06F3/0604G06F3/064G06F3/0643
Inventor 陈付梅韩德志毕坤王军
Owner SHANGHAI MARITIME UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products