Method for improving query efficiency of multi-table join in online aggregation

A connection query and multi-table query technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problem of low efficiency of multi-table connection query, achieve the effect of reducing the number of iterations and improving execution efficiency

Active Publication Date: 2018-12-21
SOUTHEAST UNIV
View PDF3 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The purpose of the present invention is to provide a method for improving the efficiency of multi-table join query in online aggregation, and solve the problem of low efficiency of multi-table join query by means of query index

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for improving query efficiency of multi-table join in online aggregation
  • Method for improving query efficiency of multi-table join in online aggregation
  • Method for improving query efficiency of multi-table join in online aggregation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] The present invention will be further illustrated below in conjunction with specific embodiments, and it should be understood that the following specific embodiments are only used to illustrate the present invention and are not intended to limit the scope of the present invention.

[0037] A method for improving the efficiency of multi-table join query in online aggregation, the method includes the following steps:

[0038] Step S1: Build an index module, select an appropriate connection attribute from the multi-table query of historical records through a mixed integer linear programming model, and establish an index for the selected connection attribute;

[0039] Step S2: According to the index created in step 1, design a multi-table join query algorithm Index Ripple Join;

[0040] Step S3: Use the central limit theorem to perform interval estimation on the collected samples, so as to obtain the confidence interval of the multi-table join query.

[0041] Further, the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for improving the query efficiency of multi-table connection in online aggregation. The invention comprises the following steps: step1,constructing an index module, selecting suitable connection attributes from multi-table queries of historical records through a mixed integer linear programming model, and establishing an index on the selected connection attributes;step2, according to the index created in the step 1, designing a multi-table join query algorithm Index Ripple Join; step3: using the central limit theorem to estimate the interval of the collected samples, so as to obtain the multi-table join query confidence interval. The invention can effectively improve the efficiency of multi-table connection inquiry in online aggregation.

Description

Technical field: [0001] The invention relates to a method for improving the query efficiency of multi-table connection in online aggregation, and in particular to a method for improving the efficiency of multi-table connection query in online aggregation by obtaining tuples meeting connection conditions by searching indexes. Background technique: [0002] With the application and popularization of social networks, Internet of Things, e-commerce, etc., the data generated in today's information age has experienced explosive growth compared with ten years ago. Enterprises, government agencies, and scientific research institutions generate a huge amount of data every day. Taobao generates 7T of data every day, and Baidu needs to process 100PB of data every day. How to process such large-scale data to mine useful information is a problem that major companies and institutions need to solve. Online aggregation can improve the speed of SQL query because it does not need to scan the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 宋爱波贡欢
Owner SOUTHEAST UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products