Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Optimization method based on HDFS small file storage

An optimization method and small file technology, applied in the direction of file system, file access structure, special data processing applications, etc., can solve the problems of NameNode node memory usage too much, performance reduction, etc., to achieve easy promotion and use, simple system structure, and improved performance effect

Inactive Publication Date: 2016-06-01
UESTC COMSYS INFORMATION
View PDF4 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] The purpose of the present invention is to overcome the problem in the prior art that storing a large number of small files in HDFS will cause the NameNode node to use too much memory and cause performance degradation, and provide a correspondence between the metadata structure and the user's small files To operate on small files, an optimization method based on HDFS small file storage that can improve the performance of HDFS system storage and processing small files

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Optimization method based on HDFS small file storage
  • Optimization method based on HDFS small file storage
  • Optimization method based on HDFS small file storage

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] Cloud storage is derived from cloud computing. Generally, cloud storage generally includes two meanings: on the one hand, cloud storage refers to the storage part of cloud computing, that is to say, the part of resources and information storage required in the operation process of cloud computing; on the other hand, it refers to a In the form of service, cloud storage service providers provide equipment or storage space, and users use the service by using browsers or other clients to avoid local storage costs. The cloud storage described in this application belongs to the latter, to be precise, it is a cloud storage service.

[0026] The technical solution of the present invention will be further described below in conjunction with the accompanying drawings.

[0027] Such as figure 1 Shown, a kind of optimization method based on HDFS small file storage of the present invention comprises the following steps:

[0028] S1. On the basis of the original HDFS architecture,...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an optimization method based on HDFS small file storage. The method includes the following steps that S1, a metadata server used for storing metadata information of user space is established; S2, a user file used for storing all small files is established for each user; S3, a metadata structure is defined and used for recording detailed metadata information of all the small files of each user, and the metadata structure records the deviant of the small files in the user file and the sizes of the small files; S4, the small files of the users are managed according to the corresponding relation between the metadata structure and the small files of the users. The user space metadata server used for storing the metadata information of the user space is introduced, the small files are operated by means of the corresponding relation between the metadata structure and the small files of the users, and the performance of an HDFS for storing and processing the small files can be improved.

Description

technical field [0001] The invention belongs to the field of performance optimization of a distributed file system, in particular to an optimization method based on HDFS small file storage. Background technique [0002] With the large-scale growth of information volume, enterprises invest more and more in data storage. There is an urgent need for new storage solutions to change the status quo, save storage costs, and reduce storage investment. Cloud storage has emerged as the times require. The prototype of cloud computing was designed by Google to turn waste into treasure. Therefore, the cloud storage architecture has its unique advantages, which cannot be replaced by traditional centralized storage. In the current form of large-scale data growth, the advantages of cloud storage over traditional storage models include cost reduction, on-demand allocation, strong scalability, strong flexibility, strong fault tolerance, and convenient data migration. [0003] Hadoop is an op...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/1824G06F16/13
Inventor 唐雪飞陈科马晨曦吴亚骏
Owner UESTC COMSYS INFORMATION
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products