Self-adaptive learning type indexing method for workload in memory database

An adaptive learning and workload technology, applied in the field of database systems, can solve the problems of limited learning index usage scenarios, weak insert and update support, and increased operation delay, to achieve efficient point query and range query performance, good insert and Update, reduce the effect of impact

Active Publication Date: 2021-06-25
ZHEJIANG UNIV
View PDF3 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the biggest problem with learning indexes is that they have weak support for insert and update, which greatly limits the usage scenarios of learning indexes.
Existing research work on inserting and updating learning-type indexes often requires a large amount of movement of key values ​​or a large-capacity buffer, which will bring large overhead and increase the delay of various operations, while the memory database is essentially for the pursuit of Designed for faster system response, reducing the cost of data insertion and update is very urgent for the application of learning index in memory database

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Self-adaptive learning type indexing method for workload in memory database
  • Self-adaptive learning type indexing method for workload in memory database
  • Self-adaptive learning type indexing method for workload in memory database

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] The technical solutions of the present invention will be further described below with reference to the accompanying drawings, and it is understood that the specific embodiments described herein are merely intended to illustrate the invention.

[0025] First, the formal definition of the segment linear interpolation model that satisfies the maximum error is given:

[0026] Give a set of ordered collection of M different keys u = {x 1 , x 2 , ..., x m }, The array of let a is a size n (n ≥ M) and an orderly store U medium key (allowed repetition). For each key X in the collection U i , Use P i = Rank (x i ) Indicates an orderless array A and less than X i Number of keys, ie x i The subscript of the first start position in the array, order D i = (X i , P i ), Construct training set D = {D 1 , D 2 , ..., D m }. Define S [i, j] = {d i , D i+1 , ..., D j }, Given the maximum error priority, the target is to divide D as little K segment s as possible in the premise of satisfying t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a self-adaptive learning type indexing method for workloads in a memory database. According to the method, a cardinal number tree and a piecewise linear model with a maximum error bound are combined, memory occupation of indexes is reduced by utilizing data distribution through a machine learning model, and meanwhile stable query performance is kept. On this basis, an efficient insertion buffer is used to reduce the cost of data insertion update, and in order to alleviate the influence of data insertion on index performance, two workload adaptive recombination optimization methods are used to optimize hotspot data involved in point query and range query in workloads in a targeted manner. The method has relatively high construction efficiency and relatively low memory occupation, relatively efficient query performance is also ensured, and insertion and updating can be well supported; And meanwhile, recombination optimization is performed in a targeted manner by sensing the query workload, so that the influence of insertion on the index performance is reduced at a relatively low cost.

Description

Technical field [0001] The present invention belongs to the technical field of database systems, and in particular to a high efficiency index method in a memory database. Background technique [0002] The index is used as a data structure for efficient retrieval, and has been widely used in the memory database system. However, due to the explosive growth of the amount of data, the number of memory consumptions in the index affects the performance and scaife of the database system. Reduce the memory of the index, while maintaining the efficient and stable query performance of the index has important practical significance and application value. [0003] The proposal of the learning index provides a new opportunity to solve the above problems, and the size of the index can be significantly reduced by utilizing data distribution. But the biggest problem with learning index is to weaken the insertion and update, which greatly limits the use scenario of the learning index. The existin...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/22
CPCG06F16/2246G06F16/2228Y02D10/00
Inventor 寿黎但陈井爽陈珂陈刚骆歆远
Owner ZHEJIANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products