A key-value data indexing method for workload-adaptive single-layer lsmt

A data indexing and workload technology, applied in the field of database systems, can solve problems such as high efficiency, few researchers, and no sense of workload, and achieve low read and write amplification factors, reduce system delay, and high read and write throughput Effect

Active Publication Date: 2022-06-21
ZHEJIANG UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The problem of workload insensitivity refers to the fact that the existing LSMT system cannot make a more appropriate optimization of the storage structure according to the distribution of reads and writes in the workload
[0003] In order to solve the problem of read and write amplification, researchers have proposed many methods, but these methods generally sacrifice read amplification in exchange for reduced write amplification (such as WiscKey, PebblesDB), and cannot guarantee efficient reading and writing.
As for the problem of unawareness of workload, few researchers have studied and solved it

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A key-value data indexing method for workload-adaptive single-layer lsmt
  • A key-value data indexing method for workload-adaptive single-layer lsmt
  • A key-value data indexing method for workload-adaptive single-layer lsmt

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] The technical solutions of the present invention will be further described below with reference to the accompanying drawings. It should be understood that the specific embodiments described herein are only used to explain the present invention, and are not intended to limit the present invention.

[0024] like figure 1 It is a flowchart of the workload adaptive key-value data indexing method on a block storage device of the present invention, and the method specifically includes the following steps:

[0025] (1) Modify the design of the LSMT storage structure, including the following sub-steps:

[0026] (1.1) Remove the middle layer of the LSMT multi-layer structure, keep the last layer, and use the last layer as the storage layer L0; replace the original fixed-capacity memory table with a dynamic-capacity memory table, and other structures remain basically unchanged: retained The log file is used to restore the memory table after a crash. The capacity value of the dy...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a key-value data indexing method of a workload adaptive single-layer LSMT. This method optimizes the traditional Log-Structured-Merge Tree (LSMT), removes the multi-layer design and fixed memory table capacity design, and introduces the design of single-layer LSMT and dynamic capacity memory table. In this method, write operations are first written to the log file on the storage device in a sequential manner, and then the memory table is modified. When the size of the memory table reaches the capacity limit, it is converted into a read-only memory table, and the read-only memory table is merged into the single-layer LSMT structure on the storage device in a background thread. On this basis, the method can automatically optimize the storage structure according to the key-value read-write distribution in the workload. The indexing method can reduce the amplification of reading and writing to the storage device at the same time, and improve the system throughput and the service life of the storage device. At the same time, adaptive optimization is made for the workload to further improve system performance.

Description

technical field [0001] The invention belongs to the technical field of database systems, and in particular relates to a key-value data indexing method for workload adaptive single-layer LSMT. Background technique [0002] The key-value storage system based on Log-Structured-Merge Tree (LSMT) is widely used in data-intensive Internet applications due to its excellent processing-intensive write capability. However, the existing LSMT-based storage systems generally have problems of amplification and workload insensitivity. The amplification problem means that the amount of data that the user requests to read / write is much smaller than the amount of data that the system actually needs to read / write on the storage device, and the amplification factor can be used to quantify this problem. The problem of workload insensitivity refers to the fact that the existing LSMT system cannot optimize the storage structure more appropriately according to the read and write distribution in ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/22G06K9/62G06N3/04G06N3/08
CPCG06F16/2228G06N3/08G06N3/044G06N3/045G06F18/23
Inventor 陈珂周信静寿黎但骆歆远伍赛江大伟陈刚
Owner ZHEJIANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products