Performance optimization method for remote file data access based on client-side high-efficiency cache

A file data and cache data technology, applied in the computer field, can solve problems such as difficult to cope with IO load, achieve the effect of improving cache utilization, improving access performance, and reducing access volume

Active Publication Date: 2021-12-17
BEIHANG UNIV
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The client uses a data block-based structure to organize and manage the cache, which is difficult to cope with the increasingly intensive IO load from the application. Different data blocks of small files in the LRU queue are isolated from data blocks of other files. Compression and deduplication are used in the cache. space utilization in such a way that applications and caching systems compete for computing resources

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Performance optimization method for remote file data access based on client-side high-efficiency cache
  • Performance optimization method for remote file data access based on client-side high-efficiency cache
  • Performance optimization method for remote file data access based on client-side high-efficiency cache

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0063] The present invention will be described in further detail below in conjunction with the accompanying drawings.

[0064] Such as figure 1 Shown is the implementation flowchart of the present invention. A method for optimizing the performance of remote file data access based on high-efficiency caching on the client side, comprising the following steps:

[0065] 1) When accessing file data, the client first accesses the management node of the wide-area file system to obtain global metadata (including user data space information, such as space name, location, etc.), caches global metadata according to spatial locality, and caches global metadata according to remote The feedback information of the access dynamically updates the global metadata cache;

[0066] 2) Dynamically identify the IO load characteristics of the application, and adaptively select a reasonable data caching strategy;

[0067] 3) Set up two-level caches, cache remote file data based on spatial locality,...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention proposes a remote file data access performance optimization method based on high-efficiency caching on the client side. The global metadata and file data are cached on the client side based on the principle of locality; the file is cached using a hybrid caching strategy based on data blocks and files. Data is cached; the cache is adaptively compressed according to the usage status of computing resources; a copy file is generated locally for frequently accessed data; the size of the cache is dynamically adjusted based on the IO load characteristics of the application; the expired cache data is fine-grained Update; based on the user's access characteristics and application load characteristics, the associated files are mined, and the index nodes of the associated files are aggregated into a large index node and cached on the client. This method can significantly improve the performance of remote file data access.

Description

Technical field: [0001] The invention discloses a remote file data access performance optimization method based on high-efficiency caching of a client, relates to challenges faced by wide-area high-performance computing, and belongs to the field of computer technology. Background technique: [0002] In a wide-area high-performance computing environment, when accessing remote file data, it is necessary to send a data request to the remote server through the client and access it. Client-side caching is a common method for improving IO performance in remote file data access. The main idea is to cache recently accessed data on the client based on the principle of data locality. When the cached data is accessed again, it can avoid network transmission and thereby Improve access performance. Most of the research mainly optimizes cache performance from two aspects: cache data organization structure and cache data management. [0003] In terms of the organization structure of the ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/172G06F16/174G06F16/13
CPCG06F16/172G06F16/1744G06F16/13
Inventor 肖利民韦冰秦广军霍志胜宋尧周汉杰
Owner BEIHANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products