Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Key value storage method and system oriented to LSM tree

An LSM tree and key-value technology, applied in file systems, digital data information retrieval, special data processing applications, etc., can solve problems such as write amplification, reduce SSD lifespan, and increase write delay, so as to reduce write pauses and maximize writes. Performance, the effect of reducing write amplification

Pending Publication Date: 2021-08-24
NANJING UNIV
View PDF8 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Excessive write amplification seriously affects the write performance of the LSM tree
However, LSM trees are often run in computers using SSDs, and frequent hard disk reads and writes will reduce the lifespan of SSDs.
In summary, for the LSM tree structure, write amplification is a very serious problem
On the other hand, when memory buffer and disk L 1 When the amount of data exceeds the threshold, the memory data cannot be flushed and must wait for L 1 Complete a compaction to free up a certain space for this layer before serving new write requests, resulting in write pauses, that is, a significant increase in phased write delays

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Key value storage method and system oriented to LSM tree
  • Key value storage method and system oriented to LSM tree
  • Key value storage method and system oriented to LSM tree

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0072] The technical solutions of the present invention will be further described below in conjunction with the accompanying drawings and embodiments.

[0073] figure 1 is a schematic diagram of an LSM tree according to an embodiment of the present invention. As shown in the figure, there is a buffer in memory to receive user write requests. WAL is a disk write-ahead log set up to avoid buffer data loss when the program crashes. The data on the disk is divided into three levels (L 1 , L 2 , L 3 ). Each layer is further divided into three sub-layers. Each sub-level contains multiple SSTable files. The data in the sub-level is ordered, and the data between different sub-levels has no relationship. This is equivalent to relaxing the ordering of the original LSM tree, the data in each level of the original LSM tree is strictly ordered, and the data in each level of the LSM tree of the present invention is divided into multiple Smaller ordered groupings.

[0074] figure ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a key value storage method and system oriented to an LSM tree. The method comprises the following steps: performing fine-grained division on disk levels, and setting a compaction strategy as follows: in a compaction task, all upper sub-levels participate in the task, and only one sub-level participates in the task at a lower layer, so as to reduce the proportion of lower-layer participation data to total participation data; divding the compaction task when the compaction task is executed, so that the number of files participating in the compaction task is reduced, and the degree of parallelism of the compaction is improved. The influence on the reading performance is reduced through a parallel reading algorithm, modeling is carried out through write amplification of the LSM tree, and a method for selecting parameters enabling write amplification to be minimized is provided.

Description

technical field [0001] The invention relates to computer storage technology, in particular to an LSM tree-oriented key-value storage method and storage system. Background technique [0002] Key-Value Store (Key-Value Store) stores data as a <key-value> collection, where the key acts as a unique identifier for the value. It does not support complex relational schemas like relational databases, but processes data through simple interfaces such as Put(k,v), Get(k), Update(k,v), and Delete(k). Due to its high performance, high scalability and other advantages, it plays an important role in today's network applications and distributed systems, and is widely used in graph databases, task queues, stream processing engines, application data caches, and event tracking systems. and other fields. [0003] LSM tree (Log-Structured Merge tree) is a storage engine widely used in key-value storage systems. It maintains a cache in memory. When a user writes a key-value pair, the da...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/13G06F16/172
CPCG06F16/13G06F16/172Y02D10/00
Inventor 王宏超叶保留唐斌陆桑璐
Owner NANJING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products