Cache method for file system with changeable data block length

A file system and data block technology, which is applied in memory systems, electrical digital data processing, special data processing applications, etc., can solve problems such as poor stability, low efficiency, and inability to effectively adapt to the characteristics of data operations

Inactive Publication Date: 2011-02-09
TSINGHUA UNIV
View PDF4 Cites 33 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Obviously, the file system cache management method based on fixed-length blocks cannot effectively adapt to the above-mentioned data operation characteristics of variable-length block segmentation based on file content, because when re-segmenting the modified data in the cache based on content, it needs The data to be split is placed in a continuous memory space. For this reason, the file system cache based on fixed-length blocks will require a large number of memory allocation and data movement operations, which is inefficient, complicated to manage, and poor in stability.
[0006] Due to the inherent defects of the fixed-length block file system caching method in supporting data deduplication, many systems that use the idea of ​​deduplication to reduce network bandwidth overhead, such as the " A Low-Bandwidth Network File System” system and the WAN data backup and synchronization system Dropbox, etc., all need to save a complete copy of the operated file on the client, regardless of whether the file copy is reconstructed by data blocks or the modified file copy is re-divided into Data blocks consume a lot of storage and computing resources, which is an inefficient implementation method

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cache method for file system with changeable data block length
  • Cache method for file system with changeable data block length
  • Cache method for file system with changeable data block length

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0080] Below in conjunction with accompanying drawing, introduce in detail the file system caching method that the data block length of the present invention is variable:

[0081] (1) Set a global cache index in the computer memory, so that the global cache index adopts the data structure of the hash table. Each record in the hash table corresponds to a file currently opened by the computer operating system. The key of each record is The file identifier of the opened file, that is, the inode number of the opened file in the file system, and the recorded value is the index tree of the data blocks contained in the opened file. Using the above global cache index, the data block index tree of an opened file can be found with O(1) complexity.

[0082] The data block index tree adopts a balanced binary search tree structure implemented by an array, and the elements in the index tree are arranged from small to large according to the offset of the starting position of the data block c...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a cache method for a file system with changeable data block length, belonging to the technical field of computer data storage. The method of the invention comprises the following steps: intercepting operating commands from a computer application program by a driving module of a user state file system, wherein the operating commands comprises opening the existing files, reading data from the opened files, writing data into the opened files, modifying the file attributes and closing the opened files; operating a cache management module in a daemon process mode; and temporarily saving partial data which is read and written recently in each currently opened file into data blocks with random length in the internal storage of a computer, thereby reducing the frequency of access of the file system to external storage devices, and improving the efficiency and throughput of accessing data in the file system. The method of the invention can enable the data reconstruction and data segmenting processes to be transparent to the user and computer application programs, thereby enabling the file system to support a repeating data deleting function under the condition that the compatibility is not affected, and effectively improving the utilization efficiency of the physical storage space.

Description

technical field [0001] The invention relates to a file system caching method with variable data block length, which belongs to the technical field of computer data storage. Background technique [0002] The file system is an important part of the computer system. It is responsible for organizing, managing and storing files on external storage devices (such as hard disks, U disks, etc.) to achieve persistent storage of data. However, it is not an efficient and wise choice to store data directly on external devices: first, external storage devices cannot support random read and write data at byte granularity like internal memory (RAM). block) as the basic operation unit for reading and writing data, a block usually includes thousands or even hundreds of thousands of bytes; moreover, the speed of data access (I / O) of external storage devices is generally several orders of magnitude slower than that of internal storage It is far behind the speed of CPU processing data and has b...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F12/08G06F17/30G06F12/0886
Inventor 唐力汪东升
Owner TSINGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products