Unlock instant, AI-driven research and patent intelligence for your innovation.

I/O requests area forecasting method based on sequence-degree clustering algorithm and time series

A time series and region technology, applied in the field of storage, can solve the problems of not knowing semantic information, pre-fetching data, etc.

Inactive Publication Date: 2009-12-23
HUAZHONG UNIV OF SCI & TECH
View PDF3 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in the existing file system, the storage system at the device layer does not know any semantic information of I / O access, so it cannot make full use of the semantic information of I / O access to prefetch the data to be accessed at the next moment. Use simpler methods such as locality of I / O access, sequential access, and iterative access to achieve simple predictions
At the same time, we know that in I / O-intensive applications, I / O accesses have bursty characteristics, and we can prefetch the next read request-intensive time at a time when I / O access is less (system idle time) All the data to improve the performance of the system, the large-capacity cache in the existing storage system also makes this method possible

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • I/O requests area forecasting method based on sequence-degree clustering algorithm and time series
  • I/O requests area forecasting method based on sequence-degree clustering algorithm and time series
  • I/O requests area forecasting method based on sequence-degree clustering algorithm and time series

Examples

Experimental program
Comparison scheme
Effect test

example

[0103] In the test of this invention, a section of trace file of HP Company was utilized, which recorded the situation of 3 disk volumes (volume numbers 21, 23 and 35) of the client access server under the typical office environment, and the time span was 191.12 hours , with 230370 requests. to a 1 、a 2 and a 3 , the present invention adopts the method of fuzzy quantification, and assigns them as 1, 2 and 3 respectively.

[0104] image 3 Shown are the I / O request points and the volume core results for volume 21 with a time span of 191.12 hours. It can be seen from the figure that five effective clusters, namely C0-C4, are obtained by the continuity-based algorithm. During the whole monitoring process, the number of clearing requests for actual clustering is 45523, accounting for 91.16% of the number of requests in the whole process.

[0105] Figure 4 It reflects the clustering area prefetch hit rate process in the whole test process. It can be seen that the prefetch ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an I / O area prefetching method based on continuity clustering and time series. According to the feature of locality of the I / O request, the invention predicts the area of ​​intensive I / O access when the system is idle, and prefetches the predicted area. It uses a clustering algorithm based on continuity, which can efficiently and reliably discover areas of intensive read requests; secondly, it uses the ARMA time series model to predict the areas and access times that future intensive read requests may visit. Under the same test environment, the storage system of the present invention and the existing RAID system have been compared and tested, and the load test using 3 disk volumes shows that: the clustering algorithm of the present invention can correctly cluster read requests; simultaneously based on AMRA Accurate prefetch-intensive I / O areas for dynamic forecasting algorithms for time series forecasting models. The invention can practically and efficiently discover the areas accessed by intensive read requests, accurately prefetch the areas that may be accessed by intensive read requests, and greatly improve the performance of the storage system.

Description

technical field [0001] The invention relates to the field of storage, in particular to an I / O area prefetching method based on continuity clustering and time series. Background technique [0002] Although the parallel I / O system (disk array) can greatly improve the performance of the storage system, there is still a large performance gap between them due to the inherent defects of the disk (longer seek delay and rotation delay). . As a basic implementation technology, caching can make up the performance gap between them very well, but simply increasing the capacity of the cache is not an effective solution to the problem. The prefetching technology predicts future data requests and stores The data blocks in the device are fetched into the cache before they are used, so that the request can always be hit in the cache to reduce the CPU pause time, and the contention for the disk or channel is eliminated by predicting the cache miss and overlapping I / O technology, reducing the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F3/06G06F17/30
Inventor 谢长生李怀阳刘艳黄建忠蔡斌
Owner HUAZHONG UNIV OF SCI & TECH