Massive data storage method simultaneously applicable to disk and solid state disk reading and writing features

A technology of solid-state hard disk and mass data, which is applied in the direction of electrical digital data processing, input/output process of data processing, instruments, etc. The effect of enhanced read performance and reduced write amplification

Active Publication Date: 2017-05-24
硬石科技(武汉)有限公司
View PDF3 Cites 19 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The problem to be solved by the present invention is: the larger write amplification of the traditional tree makes the problem of low random write efficiency, and the larger write amplification in the solid-state hard disk also seriously affects the life of the solid-state hard disk

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Massive data storage method simultaneously applicable to disk and solid state disk reading and writing features
  • Massive data storage method simultaneously applicable to disk and solid state disk reading and writing features
  • Massive data storage method simultaneously applicable to disk and solid state disk reading and writing features

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0063] The core problem to be solved by the present invention is: the large write amplification of the traditional tree makes the writing performance or the mixed performance of reading and writing low. The large write amplification in the solid-state hard disk also seriously affects the life of the solid-state hard disk. The present invention solves the above-mentioned problems by changing the complete sorting of records in a block to partial partial ordering, and adding a Bloom filter at the end of each block to minimize the impact of the scheme on read performance.

[0064] figure 1 It is a basic architecture diagram of the storage method provided by the embodiment of the present invention, which is divided into a memory part and a disk part. The memory includes one mutable memory cache and one immutable memory cache, as well as the metadata information of the tree. The metadata information of the tree describes the metadata of each block in the tree. The meta informatio...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a massive data storage method simultaneously applicable to disk and solid state disk reading and writing features. Full sequencing of records in each block is changed into partial sequencing, a Bloom filter is added to the tail portion of each block, a Log-Structured Append-Tree is created, when the quantity of data stored in each block in the tree reaches a threshold and data in the block is directly added to corresponding child blocks, the data of the child blocks is composed of multiple collating sequences rather than full sequencing is achieved in the blocks in a merging sorting mode; each block in the tree stores one Bloom filter. According to the method, on the condition that no other properties are sacrificed, write amplification is greatly reduced, and the random writing efficiency is greatly improved. Besides, the service life of a solid state disk is better protected and prolonged. In read and write mixed scenes, the random read property is also enhanced, and the method has important market value.

Description

technical field [0001] The invention belongs to the field of mass data storage, and in particular relates to a storage tree. The method can simultaneously adapt to the reading and writing characteristics of disks and solid state hard disks. Background technique [0002] Commonly used index trees on existing hard disks include B-tree, LSM-tree, buffer-tree, and the like. Among them, B-tree is a traditional classic tree, but because it inevitably randomly writes to disk in random write scenarios, its performance is low when storing massive data, so its variant is often used when storing massive data, such as BigTable for A variant of B-tree is used in combination with LSM-tree. For the storage of massive data, LSM-tree or buffer-tree (also known as fractal-tree) is often used as the index tree. The common feature of the two is to postpone the writing of the records to be written, and wait until a certain amount is accumulated. Batch processing. This can better solve the pro...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F3/06
CPCG06F3/061G06F3/064G06F3/0685
Inventor 龚才鑫龚奕利
Owner 硬石科技(武汉)有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products