A Construction Method of Bloom Filter for Multi-layer Database Based on Hierarchical Allocation

A technology of bloom filter and construction method, which is applied in the fields of electrical digital data processing, digital data information retrieval, special data processing applications, etc., can solve the problems of reducing storage space, that is, memory overhead, etc.

Active Publication Date: 2019-10-22
PINGCAP XINGCHEN (BEIJING) TECH CO LTD
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The purpose of the present invention is to propose a method for constructing a Bloom filter based on hierarchical allocation of multi-layer databases, so as to make up for the existing defect of assigning fixed bits to each key in each file in each layer, and to ensure that the distribution On the premise that the storage space overhead of the Bloom filter is the same as before, reduce the hard disk input / output (I / O) operations caused by the false positives of the Bloom filter in the query operation of LevelDB, improve the query operation speed of LevelDB, or in Ensure that the overall false positive rate of the LevelDB Bloom filter is basically the same as before, and reduce the storage space, that is, the memory overhead

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Construction Method of Bloom Filter for Multi-layer Database Based on Hierarchical Allocation
  • A Construction Method of Bloom Filter for Multi-layer Database Based on Hierarchical Allocation
  • A Construction Method of Bloom Filter for Multi-layer Database Based on Hierarchical Allocation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0031] The Bloom filter bit number allocation method based on layered allocation of the present invention will be further described in detail in conjunction with the accompanying drawings through specific embodiments.

[0032] Example 1:

[0033] In this embodiment, the method for constructing a Bloom filter based on a hierarchically allocated multi-layer database specifically includes the following steps.

[0034] The first step: storage structure design, and corresponding initialization, using:

[0035] L--Indicates the number of layers of the multi-layer key-value database (LevelDB),

[0036] B - the average number of Bloom filter bits assigned to each key,

[0037] Si—LevelDB storage space size of the i(0<=i

[0038] SL---the total size of LevelDB,

[0039] R—the growth multiple of the LevelDB interlayer size,

[0040] Bi—the number of Bloom filter bits assigned to the key at level i,

[0041] F(Bi)---The false alarm rate of the Bloom filter when the number...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a bloom filter construction method based on a hierarchical and distributive multi-layer database. The method is characterized by comprising the steps of designing and constructing a storage structure, calculating a combination probability sum, solving the digit distributed to each layer of a Bloom filter and maintaining the storage structure; when a union operation is carried out, more Bloom filter bit digit is distributed to each key at high layer located at a multi-layer key value database (LevelDB), and less Bloom filter bit digit is distributed to keys at lower layer, wherein the false alarm rate of the Bloom filter is reduced with the increasing of the bit digit. Compared with an LevelDB traditional Bloom filter construction method, when memory overhead is ensure to be consistent with traditional memory overhead, according to the method, hard disk input and output (I / O) operation caused by false alarm of the Bloom filter can be reduced, the times of I / O of the hard disk are reduced, and average delay of LevelDB query operation is reduced; when expected times of I / O of the hard disk are ensured to be basically equal to that of a traditional method in the process of query, the memory overhead brought about by the Bloom filter is reduced.

Description

technical field [0001] The invention belongs to the technical field of computer data storage, and in particular relates to a Bloom filter construction method for hierarchically allocating the number of bits of a Bloom filter of a multi-layer key-value database system using a log merging tree as a main body. Background technique [0002] The Bloom filter (Space / time trade-offs in hash) introduced by the American Association for Computing Machinery's "1970 ACM CACM International Journal" (Communications of the ACM, Volume 13, July 1970, Question 7, Pages 422-426) coding with allowable errors) is an extremely space-efficient probabilistic data structure for checking the presence or absence of an element in a collection. When the multi-layer key-value database system (LevelDB) based on the log merge tree deployed on the hard disk that is open sourced by Google implements and uses the Bloom filter technology, it stores the key-value pairs in the file (SSTable) for each layer. A ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/2453
Inventor 李永坤张月明张伟韬吕敏许胤龙
Owner PINGCAP XINGCHEN (BEIJING) TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products