Cache method and system oriented to distributed sequence list

A distributed sequential table and cache system technology, applied in database distribution/replication, special data processing applications, instruments, etc., can solve problems such as low data read throughput, low cache hit rate, and difficulty in accurately predicting the future.

Active Publication Date: 2015-11-11
INST OF COMPUTING TECH CHINESE ACAD OF SCI
View PDF3 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The various algorithms mentioned above determine the swap-in and swap-out strategy based on the past access history of the data, but it is difficult to accurately predict the future based on the access history of the data alone, resulting in a low cache hit rate and poor cache performance. The data reading throughput is not high. Therefore, to further improve the cache hit rate, some other information needs to be considered. In the specific implementation of the distributed sequence table, the access mode characteristics of the query operation can be used to improve the cache hit rate.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cache method and system oriented to distributed sequence list
  • Cache method and system oriented to distributed sequence list
  • Cache method and system oriented to distributed sequence list

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] Aiming at the deficiencies of the prior art, the present invention proposes a caching method and system oriented to a distributed sequence table. The present invention obtains the data range information involved in each query through the unique storage structure of the distributed sequence table, and then uses the information to And combined with the data being accessed to construct a real-time dynamic score for each data, the score represents the number of times the corresponding data will be accessed in the future, the more access times, the more it should be kept in the cache, and finally use the score of each data To guide the data swap-in and swap-out strategy, the following are the detailed steps of the method of the present invention, as follows:

[0034] The inventive method comprises:

[0035] Step 1, set the basic score for all data blocks (this basic score is not limited, it can be any integer), when the data query operation is performed, the data range invol...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the technical field of distributed information processing, in particular to a cache method and system oriented to a distributed sequence list. The method comprises the following steps: setting basic scores for all data blocks; obtaining a data range related to a data query operation through the distributed sequence list when the data query operation is carried out; carrying out to-be-accessed access frequency statistics on each data block in the data range and accumulating the access frequency to corresponding data blocks; when the data blocks are targeted by a cache, subtracting a fixed value from the scores of the targeted data blocks; when a cache space is full, triggering a swap-out operation of the cache; and when the swap-out operation starts, sorting according to the scores of the data blocks and weeding out from the data block with the lowest score until data blocks can be increased to the cache space.

Description

technical field [0001] The invention relates to the technical field of distributed information processing, in particular to a caching method and system for a distributed sequence table. Background technique [0002] The existing NoSQL cache system generally adopts the LRU cache algorithm or an algorithm improved from the LRU cache algorithm, as follows: [0003] LRU algorithm: The LRU algorithm is based on the historical access records of the data to determine the strategy of swapping in and swapping out data. The basic idea is: the data has been accessed recently, and the probability of being accessed in the future is higher. In the specific implementation, Generally, LinkedList is used as the storage container. After being accessed, the cached object will be put back on the top of the cache. When the object is swapped out, it will start from the bottom of the cache. The specific process of the algorithm is as follows: (1) The new cached object is always placed on the top o...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/24552G06F16/2471G06F16/27
Inventor 查礼郑忠诚程学旗
Owner INST OF COMPUTING TECH CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products