Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Read-write solution for tens of millions of small file data

A solution method and technology for file data, applied in the field of computer applications, can solve the problems of reducing the read rate of small files, low disk I/O performance, file fragmentation disk space, etc., so as to improve file transmission performance, improve cache utilization, Reduce the effect of frequent I/O operations

Inactive Publication Date: 2015-02-25
LANGCHAO ELECTRONIC INFORMATION IND CO LTD
View PDF0 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] (1) Due to the high access frequency of small files and the need to access the disk multiple times, the performance of disk I / O is low;
[0005] (2) Because the file is relatively small, it is easy to form file fragments and cause waste of disk space;
[0006] (3) It is easy to generate network delay when establishing a connection for each small file request, which reduces the reading rate of small files

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0018] A reading and writing solution for tens of millions of small file data of the present invention will be described in detail below.

[0019] A kind of reading and writing solution of tens of millions of small file data of the present invention, the method is as follows:

[0020] The storage structure layout design method for small files on the disk array is as follows:

[0021] When storing small files, the present invention stores a large number of small files by opening up a large continuous disk space, that is, stores logically continuous data on the continuous space of the disk array as much as possible, that is, the data of the same file Or store the file data under the same folder on continuous disk array blocks as much as possible; the disk space is divided into multiple blocks, and the size of each block is 64KB. The basic idea is: each small file can only be stored in In a single block, it cannot be stored across 2 blocks. Each folder will have one or more bloc...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a read-write solution for tens of millions of small file data. According to the solution, the mode of creating large-block continuous disk space is used for storing a large number of small files when the small files are stored, namely, logically continuous data are stored on the continuous space of a disk array as far as possible; the disk space is divided into a plurality of blocks, and the size of each block is 64 KB. According to the basic thought, each small file can only be stored in the single block and cannot be stored by crossing two blocks, each folder can be provided with one or more blocks, the blocks are only used for storing the data of the corresponding folder, and each piece of file data is stored on the continuous disk space. Compared with the prior art, the logically continuous data are stored on the continuous space of each physical disk as far as possible, the cache technology is used for playing the role of a metadata server, the cache utilization rate is improved through simplified file information nodes, and thus the access performance of the small files is improved.

Description

technical field [0001] The invention relates to the technical field of computer applications, in particular to a reading and writing solution for tens of millions of small file data. Background technique [0002] At the current stage of reading and storage, small files are the most common data form for data access and use. Compared with the striping technology of large files, slicing is used to improve the concurrency of user access to files. Small files (≤64KB) are not conducive to striping. The traditional method is to store a single file in a single data server. However, when the number of small files reaches a certain level, a large number of repeated accesses to small files will bring performance burdens and I / O bottlenecks to the data server. The form of small files with high frequency is manifested, and in the information reading and storage of general users, there are more reading and storage of small files, so the research on the performance of high-frequency small...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F3/06G06F12/02
CPCG06F3/061G06F3/0631G06F3/0644
Inventor 张砚波吴丙涛
Owner LANGCHAO ELECTRONIC INFORMATION IND CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products