Data query method and device for parallel database

A data query device and data query technology, applied in the field of data query, can solve problems such as increasing network communication overhead, affecting query performance, and no solution has been proposed, so as to achieve the effects of improving resource utilization, reducing network overhead, and improving performance

Inactive Publication Date: 2016-12-21
DAWNING INFORMATION IND BEIJING
View PDF5 Cites 20 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In order to ensure the correct determination of the result set, the current parallel database executes the aggregation query mainly by aggregating the data to a node for aggregation, but this method also brings a problem that the data a...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data query method and device for parallel database
  • Data query method and device for parallel database
  • Data query method and device for parallel database

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. All other embodiments obtained by persons of ordinary skill in the art based on the embodiments of the present invention belong to the protection scope of the present invention.

[0038] According to an embodiment of the present invention, a data query method for a parallel database is provided.

[0039] like figure 1 As shown, the data query method according to the embodiment of the present invention includes:

[0040] Step S101, on each database node, perform group aggregation of target data on the target data table according to the corresponding associated fields between the target data table and other data tables;

[0041] Step S103, on each database node, carry ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a data query method and device for a parallel database. The method comprises the steps of: respectively performing target data grouping aggregation on a target data table according to corresponding associated fields between the target data table and other data tables on each database node; respectively performing data re-partitioning on corresponding grouping aggregation results and corresponding other data tables in a Hash manner according to the corresponding associated fields on each database node; collecting data re-partitioning results of the grouping aggregation results and data re-partitioning results of the other data tables on each database node into a target database node; and performing target data connecting aggregation on the data re-partitioning results of the grouping aggregation results and the data re-partitioning results of the other data tables on the target database node. According to the method provided by the invention, data aggregation query can be realized, meanwhile, the parallelism of the query is improved, the resource utilization rate of a cluster is increased, the network cost is reduced, and the query performance is improved.

Description

technical field [0001] The invention relates to the field of parallel databases, in particular to a data query method and device for parallel databases. Background technique [0002] With the advent of the era of big data, compared with traditional data analysis, we now encounter greater challenges, on the one hand, the explosive growth of data volume, and on the other hand, the increase of data types. Faced with these challenges, Hadoop (a distributed system infrastructure developed by the Apache Foundation) emerged as the times require to solve the problem of offline data analysis, but for real-time data analysis requirements, Hadoop cannot meet the requirements of real-time data analysis due to its own characteristics. Therefore, parallel databases are still the main tool for real-time structured data analysis. [0003] In a parallel database system, aggregation and association query are the main methods for data analysis, and most of the analysis will involve the connec...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/24556G06F16/244G06F16/2456G06F16/278
Inventor 郭庆李晋钢张建磊惠润海宋怀明
Owner DAWNING INFORMATION IND BEIJING
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products