Transcript placement method based on file heat analysis and K-means

A file and heat technology, applied in the field of cloud computing, can solve the problems of not taking into account topology, data distribution network bandwidth, node storage capacity, and the impact of file size and network bandwidth access delay, etc., to reduce IO congestion. , reduce the response time, improve the effect of rationality

Inactive Publication Date: 2016-05-11
NANJING UNIV OF INFORMATION SCI & TECH
View PDF5 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

These strategies can reduce access latency in most cases, but the waterfall strategy, caching waterfall strategy, and rapid expansion strategy are only applicable to data grids where data is stored on the top-level nodes, and the best client strategy and common cache strategy do not take into account Topologica

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Transcript placement method based on file heat analysis and K-means
  • Transcript placement method based on file heat analysis and K-means
  • Transcript placement method based on file heat analysis and K-means

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] Below in conjunction with accompanying drawing, the implementation of technical scheme is described in further detail:

[0025] A copy placement method based on file heat analysis and K-means according to the present invention will be further described in detail in conjunction with the flow chart and the implementation case.

[0026] This implementation case uses file heat analysis and K-means algorithm to adjust and place copies in distributed systems or cloud environments. Such as figure 1 As shown, this method includes the following steps:

[0027] Step 1), according to the execution time of the task, select the minimum value as the time period of heat analysis, and analyze the access frequency of the file in this time period;

[0028] Step 101), in a distributed system or cloud environment, the execution time of different tasks is different. When performing file heat analysis, when a task is completed, a copy adjustment can be performed, and the last task executio...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a transcript placement method based on file heat analysis and K-means. At first, The access frequency of a file in a given period of time is analyzed to calculate the access heat of the file. A possible file with high access heat is predicated by the access heat of the file, in combination with the K-means algorithm, and the number and placement positions of file transcripts are dynamically adjusted on demand by comprehensively considering a statistical cycle, a file size, a working environment and other factors. The transcript placement method provided by the invention can be used for effectively shortening the average response time of file access and improving the data service performance.

Description

technical field [0001] The invention belongs to the field of cloud computing, and specifically relates to a method for dynamically adjusting and placing copies of high-hot files in a cloud environment by using heat statistical analysis and K-means algorithm. Background technique [0002] With the development of society and the improvement of computer storage and data processing capabilities, the explosive growth of data has become an important feature of today's era. According to the estimate of data growth by International Data Corporation (IDC), 40ZB will be generated by 2020 (1ZB=1.1805916207174113×10 21 The data of B) is equivalent to 5247GB per capita on the earth (http: / / datacenter.watchstor.com / infra-143421.htm). Facing the ever-increasing mass of data, the subsequent storage and management of massive data has also received more and more attention. [0003] In order to improve the reliability and access efficiency of the system, the commonly used copy technology cop...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/128
Inventor 马廷淮李坚田伟金子龙
Owner NANJING UNIV OF INFORMATION SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products