SQL to cloud environment mapping method and system

A mapping method and cloud environment technology, applied in the field of mapping from SQL to cloud environment, to achieve the effect of improving query performance and reducing the amount of actual input data

Inactive Publication Date: 2015-03-11
INST OF COMPUTING TECH CHINESE ACAD OF SCI +1
View PDF3 Cites 20 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

These existing optimization schemes can improve the efficiency of automatic conversion from SQL to Mapreduce to a certain extent, but there is still room for further improvement

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • SQL to cloud environment mapping method and system
  • SQL to cloud environment mapping method and system
  • SQL to cloud environment mapping method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0042] The inventor's research has found that most of the existing automatic conversion schemes from SQL to Mapreduce focus on the optimization of the Mapreduce level, while ignoring the diversity of SQL statements under the same logical conditions (the same function can be realized by different SQL statements), and this The performance difference brought about by variety. In view of this problem, the present invention proposes a rule-based query rewriting optimization method to guide the conversion of SQL to HiveQL, generate multiple conversion schemes, and proposes a cost-based evaluation method to select the optimal conversion scheme, So as to achieve the purpose of reducing overhead. Among them, the HiveQL language can be regarded as a subset compatible with Hive in the SQL language.

[0043] In general, the present invention provides a kind of method that SQL statement is converted into efficient HiveQL query language, thereby realizes the mapping of SQL to cloud environ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an SQL to cloud environment mapping method. The method includes following steps: 1) rewriting the SQL statement to be transformed to obtain a plurality of rewritten HiveQL statement having the same function with the SQL statement to be transformed; 2) calculating the exercise price of each HiveQL statement and selecting the optimal statement for outputting. The SQL to cloud environment mapping system is also provided. The cloud environment statement (as HiveQL statement) can be quickly optimized while mapping the SQL statement to cloud environment for reducing actual input data size of the Mapper and Reducer and promoting the query performance.

Description

technical field [0001] The invention relates to the technical field of parallel database, query rewriting and query optimization, and in particular, the invention relates to a mapping method from SQL to cloud environment. Background technique [0002] MapReduce provides a highly abstract programming model, which allows users to quickly distribute and execute their programs in a cluster composed of multiple machines by only focusing on the logic implementation of the two functions (Mapper and Reducer). Among them, there is no need to pay attention to low-level system issues such as data placement and task scheduling. With the rise of cloud computing technology, Hive, which is used to automatically convert high-level abstract languages ​​into Mapreduce task flows, came into being, which greatly improves the reusability of data analysis codes. Both Hive and Mapreduce are widely used technologies in Internet companies. Many companies use them to create web page indexes, analyze...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30H04L29/08
CPCG06F16/24534
Inventor 徐英钟虎嵩林苏伊鹏陈建吴凯锋李祥珍王志强张春光裘炜浩肖政衡星辰
Owner INST OF COMPUTING TECH CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products