Cross-platform unified big data SQL query method

A query method and cross-platform technology, applied in the field of big data SQL query and automatic scheduling, can solve problems such as poor usability, low performance, and high data migration overhead, and achieve the goal of improving efficiency, solving usability, and improving query performance. Effect

Active Publication Date: 2019-07-26
NANJING UNIV
View PDF2 Cites 23 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] Purpose of the invention: Aiming at the current lack of an easy-to-use, unified and efficient cross-platform query method, the purpose of the present invention is to provide an efficient cross-platform unified big data SQL query method, provide users with a unified query language and shield the bottom layer from different The heterogeneity of the platform divides the cross-platform query into multiple sub-queries, and automatically completes the scheduling execution and data migration of sub-queries between multiple execution platforms, so that users only need to pay attention to the query statement itself, thereby greatly improving user query. The efficiency of analysis solves the problems of poor usability, low performance, and huge data migration overhead of existing cross-platform query methods

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cross-platform unified big data SQL query method
  • Cross-platform unified big data SQL query method
  • Cross-platform unified big data SQL query method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] In the following, the present invention will be further clarified with reference to the accompanying drawings and specific embodiments. It should be understood that these embodiments are only used to illustrate the present invention and not to limit the scope of the present invention. After reading the present invention, those skilled in the art will understand various aspects of the present invention. Modifications in equivalent forms fall within the scope defined by the appended claims of this application.

[0025] The invention proposes a cross-platform unified big data SQL query method, which solves the problems of poor ease of use, low performance, and huge data migration overhead of the existing cross-platform query method, thereby greatly improving the efficiency of users performing cross-platform query. The complete process of the present invention is as figure 1 As shown, it includes 7 parts: SQL statement analysis and verification, rule-based optimization, cost-ba...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a cross-platform unified big data SQL (Structured Query Language) query method, which comprises the following steps of: expanding part of SQL semantics, and providing a unifiedcross-platform SQL query language for a user; enabling a unified SQL parser to parse the query statement submitted by the user into a logic query plan, and verifying the legality of the query statement according to the meta information stored in the unified meta database; enabling a cross-platform optimizer to optimize the structure and the connection sequence of the logic query plan and convertthe logic query plan into an optimal physical execution plan composed of a plurality of sub-queries bound with an execution platform; enabling a cross-platform scheduler to convert the optimal physical execution plan into a task flow graph, automatically schedule the task flow graph according to the dependency relationship among the tasks and execute all the tasks; designing a unified platform layer interface meeting cross-platform SQL query requirements, and shielding operation differences between different execution platforms. The cross-platform unified big data SQL query method solves the problems that an existing cross-platform query method is poor in usability, low in performance, large in data migration cost and the like.

Description

Technical field [0001] The invention relates to the field of big data SQL query and automatic scheduling, in particular to a cross-platform unified big data SQL query method. Background technique [0002] SQL is currently the most widely used data query and analysis language. More and more big data systems provide support for SQL. Typical representatives are Apache Hive, Apache Spark SQL, and Apache Impala. In fact, big data SQL query analysis is still one of the most widely used technologies in the industry's big data analysis applications. However, in order to meet the different needs of various industries for big data analysis, many database query systems focusing on different fields have emerged. These systems often have great differences in query languages, data formats, computing models, system architectures, and underlying storage technologies, which greatly increase the learning and use costs of data analysts, and raise the threshold for big data query and analysis. Li...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/242G06F16/21
CPCG06F16/2433G06F16/214Y02D10/00
Inventor 黄宜华朱光辉尹良良
Owner NANJING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products