Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method for storing mass of small files on basis of master-slave distributed file system

A distributed file, massive small file technology, applied in the field of massive small file storage based on master-slave distributed file system, can solve the problem of consuming metadata node memory, only a single, etc.

Active Publication Date: 2013-04-03
NAT UNIV OF DEFENSE TECH
View PDF4 Cites 53 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0011] The technical problem to be solved by the present invention is: the general master-slave distributed file system can store files with a large data scale, and has high fault tolerance and high scalability, but it will cause some problems when used to store a large number of small files: (1) Centralized metadata service The master-slave distributed file system has only a single metadata node, and the number of files determines the size of the metadata. A large number of small files will consume the memory of the metadata node, and its metadata will exhaust the memory of the metadata node. memory beyond the limits of computer hardware

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for storing mass of small files on basis of master-slave distributed file system
  • Method for storing mass of small files on basis of master-slave distributed file system
  • Method for storing mass of small files on basis of master-slave distributed file system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0065] The specific embodiment of the present invention will be described with reference to the accompanying drawings.

[0066] figure 1 It is a structural diagram of a master-slave distributed file system.

[0067] figure 2 It is an overall structural diagram of the massive small file storage system constructed in the first step of the present invention. The massive small file storage system consists of a master-slave distributed file system and software for processing massive small files on each node of the master-slave distributed file system. These software include the index location maintenance module on the metadata node, the small file index module on the data node, the client cache module and the SmallFileAPI, a special interface for the client to operate small files. The index position maintenance module allocates data nodes (identified by IP address and port number) for each directory, sorts the mapping relationship between directories and data nodes, and returns...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for storing a mass of small files on the basis of a master-slave distributed file system, and aims to solve the problem about storing the mass small files in the master-slave distributed file system. The technical scheme includes that the method includes that a mass small file storage system is deployed and initialized, and then small files are created and read by a SmallFileAPI (small file-application program interface) of a client side according to a command received from a keyboard. When the small files are created by a system, the SmallFileAPI creates data files of the small files according to small file paths obtained from the client side, writes in small file data and simultaneously creates small file indexes at data nodes; when the small files are read by the system, data node information corresponding to a parent directory is acquired according to the small file paths, an index request is transmitted to any of the data nodes, and finally the small file data are read from the data files according to index information. By the aid of the method, the problem about mass of storage metadata of the mass of small files is solved, writing efficiency of the mass small file storage system is improved, and reliability of the system is guaranteed.

Description

technical field [0001] The invention relates to a method for storing a large number of small files on a master-slave distributed file system oriented to storage of a large number of large files. Background technique [0002] With the development of new computing technologies, both business and personal data began to grow rapidly. The growth of massive data brings not only the problem of storage capacity, but also brings challenges to data management and storage performance, which has become the core problem that needs to be solved in the era of cloud computing. In order to ensure high availability, high reliability and economy of data, cloud computing uses distributed storage to store data, and redundant storage to ensure data reliability. In order to meet the needs of a large number of users, the storage technology of cloud computing must have high throughput and high transfer rate. For the data storage problem of cloud computing, industry and academia have proposed a var...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
Inventor 王蕾何连跃徐叶李姗姗戴华东吴庆波丁滟黄辰林付松龄
Owner NAT UNIV OF DEFENSE TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products