Parallel query processing method for share-nothing database cluster in cloud computing environment

A technology of cloud computing environment and processing method, applied in the direction of electric digital data processing, special data processing application, computing, etc., to achieve the effect of improving execution speed

Inactive Publication Date: 2012-06-27
北京人大金仓信息技术股份有限公司
View PDF3 Cites 40 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, traditional single-node databases cannot fundamentally meet the above requirements

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Parallel query processing method for share-nothing database cluster in cloud computing environment
  • Parallel query processing method for share-nothing database cluster in cloud computing environment
  • Parallel query processing method for share-nothing database cluster in cloud computing environment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] In a cloud computing environment, data in a shared-nothing database cluster is stored in pieces, and each data node has only a part of the data. During the execution of some SQL statements, some data nodes need to operate on all data including other node data to complete the query plan. For this reason, the present invention inserts the node data flow operation at the appropriate place of the query plan tree of the data node, so that the data can flow among the nodes during the query execution process, so that each data node can obtain all the required data, Complete the concurrent execution query process. The following is a detailed description of this.

[0026] A shared-nothing database cluster as a specific embodiment of the present invention includes a control node and one or more data nodes. Each data node is responsible for storing data, accepting access plans and executing them, and then returning plan processing results. The user's data is firstly stored on e...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a parallel query processing method for a share-nothing database cluster in a cloud computing environment. The method comprises the following steps of: firstly, segmenting a query plan to obtain sections which are executed by all data nodes; secondly, performing section processing on all data nodes to obtain Hash joins of all data nodes; and thirdly, performing Hash aggregation on control nodes by flowing of aggregated data between all data nodes to obtain a query result set. By the flowing operation of node data, the data can flow between all nodes in the query execution process, the parallel execution query process is finished, and an execution speed is greatly increased.

Description

technical field [0001] The invention relates to a database parallel query method, in particular to a shared-nothing database cluster parallel query processing method under a cloud computing environment, and belongs to the technical field of database cluster systems. Background technique [0002] With the continuous deepening of enterprise and government informatization, the complexity of database applications is increasing, and it is urgent to solve the problems of massive data processing, massive data storage and high scalability in order to provide good data storage and query services for business support systems . However, traditional single-node databases cannot fundamentally meet the above requirements. [0003] To this end, people draw on the technical concept of computer clusters (connecting a group of loosely integrated computer software and / or hardware to work closely together to complete computing work), and connect multiple databases to form a database cluster sy...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 李睿峰王殿成冯玉李祥凯冷建全
Owner 北京人大金仓信息技术股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products