Batch data query method and device

A batch data and query method technology, applied in the field of data warehouse query optimization, can solve the problems of low efficiency of big data query and query

Active Publication Date: 2016-06-15
HUAWEI TECH CO LTD +1
View PDF5 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In sharp contrast to the lack of opportunities for intra-query optimization, there are abundant inter-query optimization opportunities in the data warehouse batch query application scenario. Inter-query optimization opportunities refer to the optimization opportunities between multiple query statements. In the batch query application scenario Therefore, there are a lot of opportunities for query optimization. However, in the prior art, query optimization is only performed for a single query statement, which makes the query efficiency of big data query low.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Batch data query method and device
  • Batch data query method and device
  • Batch data query method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0078] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments.

[0079]The method of each embodiment of the present invention is mainly applied in the scene of batch data query, for example, it can be applied in the data warehouse, there are a large number of batch data query opportunities in the data warehouse, there is a large amount of data in the data warehouse, these data are usually stored in in a distributed file system.

[0080] figure 1 It is a flow chart of the batch data query method provided in Embodiment 1 of the present invention. The method provided in this embodiment can be executed by a batch da...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides a batch data query method and a device. The method comprises following steps: obtaining multiple logic inquiry plan trees corresponding to multiple query statements; diving multiple query groups by all operators of each logical query plan tree, wherein logical query plan trees of each query group are in the relationships as follows: inter-query write-read dependency relationships; overlapping relationships between operators or operand overlapping relationships; optimizing inter-query logical query plan trees of each query group and generating multiple corresponding physical query plan trees based on multiple optimized query plan trees; and finally, executing physical obtaining query results of multiple sentences based on multiple physical query plan trees. The method helps to increase optimization opportunities between logical plan trees in query groups by grouping the multiple logical query plan trees. By performing inter-query optimization on the logical query trees of query groups, efficiency of batch data query is increased.

Description

technical field [0001] The embodiment of the present invention relates to data warehouse query optimization technology, in particular to a batch data query method and device. Background technique [0002] At present, representative big data query systems (such as: Hive, Shark, Impala) all use a single query statement as the basic unit of parsing and optimization. The basic query processing flow is as follows: First, parse the read single query statement into a tree The logical query plan tree of the structure, and then, by selecting and implementing the algorithm for each operator of the logical query plan tree, and determining the execution order of these operators, the logical query plan tree is converted into a physical query plan, and finally, the query execution engine Execute the physical query plan and output the query result. [0003] A key performance of a big data query system is query efficiency. At present, the commonly used method to improve the query efficienc...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/2454G06F16/24542G06F16/24524
Inventor 李丰王蕾张赟冯晓兵
Owner HUAWEI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products