Unlock instant, AI-driven research and patent intelligence for your innovation.

A Method for Predicting User Hot Data Access for Massive Small Files

A massive small file and file access technology is applied in the field of user hot data access prediction for massive small files, which can solve the problems of low reading efficiency in distributed storage systems, reduce I/O times and improve reading efficiency Effect

Active Publication Date: 2019-11-05
HARBIN INST OF TECH AT WEIHAI +1
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] Aiming at the problems existing in the above-mentioned prior art, the present invention provides a user hotspot data access prediction method for massive small files, which solves the problem of low reading efficiency of the distributed storage system in the environment of massive small files

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Method for Predicting User Hot Data Access for Massive Small Files
  • A Method for Predicting User Hot Data Access for Massive Small Files
  • A Method for Predicting User Hot Data Access for Massive Small Files

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be described in further detail below with reference to the accompanying drawings and preferred embodiments. However, it should be noted that many of the details listed in the specification are only for readers to have a thorough understanding of one or more aspects of the present invention, and these aspects of the present invention can be implemented even without these specific details.

[0041] A user hotspot data access prediction method for massive small files provided by this embodiment, its flow chart is as follows figure 1 As shown, the method includes the following steps:

[0042] (1) Read the file access log generated by the distributed massive small file storage system to obtain the file access history sequence. In this embodiment, the file logs generated by the distributed massive small file storage system are stored in the proxy node i...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a user hot data access prediction method for massive small files, starting from the characteristics of user access data, according to the correlation characteristics of file access, using user-related distributed massive small file storage system file access logs, training The Skip-Gram model extracts the contextual access features of files, uses the K-means algorithm to cluster file features, conducts centralized analysis of files with high access similarity, trains the GRU model, analyzes the correlation between files, and According to the category sequence of the user's current access file, it will prefetch all the files in the file category that the user may access in the future to the cache, reducing the number of I / O of the system and improving the reading efficiency of the distributed massive small file storage system as a whole. .

Description

technical field [0001] The invention relates to the field of computers, in particular to a user hotspot data access prediction method for massive small files. Background technique [0002] The rapid development of smart devices and e-commerce has brought about a sharp increase in the number of small files. According to the report of the International Data Center, the world has entered the ZB era, and the global data volume has doubled within two years. A small file refers to a file size between 10KB and 512KB. In most cases, these massive small files are stored in a distributed storage system so that users can use any device that can access the network to access these files. The cloud storage system reduces the user's demand for local storage capacity and ensures that the files accessed by the user are all the latest copies. However, in the storage environment of massive small files, the user's file access operation presents high concurrency characteristics. [0003] Tra...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/172G06F16/182G06K9/62
CPCG06F16/172G06F16/182G06F18/23213G06F18/214
Inventor 朱东杰杜海文李晓芳刘海青章江山王玉华孙云栋张凯
Owner HARBIN INST OF TECH AT WEIHAI