Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Parallel SQL automatic generation method

A technology for automatically generating and virtual tables, applied in structured data retrieval, special data processing applications, instruments, etc., can solve problems such as time-consuming and energy-consuming, time-consuming table operations with large data volumes, and inability to achieve parallelization. Achieve the effect of breaking through its own limitations and improving execution efficiency

Active Publication Date: 2020-06-09
深圳市魔数智擎人工智能有限公司
View PDF6 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003]Data extraction is used as a pre-data set for data processing and subsequent modeling. It usually requires data personnel to manually write a large amount of SQL, even if the SQL already has a concise and easy However, the process of writing SQL and correcting grammatical errors still consumes a lot of time and energy, and repeated work is difficult to reuse. At the same time, it is necessary to deal with the subtle differences between SQL languages ​​of different databases to achieve the goal of grammatical compatibility.
[0004]The current existing SQL generation system can realize syntax generation and error correction, and can optimize execution efficiency from the perspective of syntax, and can In terms of performance, the execution speed of SQL is improved, but the traditional table-to-table aggregation still depends on the order of execution between tables, so it cannot achieve true parallelization, and it cannot break through the database's limitation on the number of columns in a single table. The upper limit is limited; in addition, the current conventional multi-table data aggregation method only supports multi-table aggregation of the same source database. If you want to aggregate the tables of multiple types of databases, you need to move the data. This form is for big data. Quantitative table operations are very time-consuming

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Parallel SQL automatic generation method
  • Parallel SQL automatic generation method
  • Parallel SQL automatic generation method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0038] To achieve the above object, the technical solution of the present invention is as follows.

[0039] refer to Figure 1-32 , the present invention is a kind of parallelized SQL automatic generation method, it is characterized in that, the concrete steps of this method are as follows:

[0040] refer to Figure 1-3 and Figure 32 , S1: According to the single table of more than one data set, and complete the configuration of the tree relationship structure between the tables, and obtain the table relationship tree, the single table includes a main table and more than one sub-table, and the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a parallelized SQL automatic generation method, which comprises the following steps of: obtaining a relationship tree of associated information among tables according to morethan one data table, carrying out breadth-first traversal, and pre-analyzing structural information of each layer of tree structure in the relationship tree of the tables, wherein each input data source table corresponds to one virtual table; aiming at the structure information corresponding to each virtual table, generating an association index table capable of expressing an association relationship between every two single tables; establishing indexes for the association relationship fields on all the association index tables; broadcasting the association index table to different types of data warehouses to realize transmission of constraints and association relationships among single tables in different libraries, and the virtual table corresponding to each of the other sub-tables can be directly associated with the virtual table corresponding to the main table through the association index table. By decoupling the dependency relationship between the tables, the execution efficiencyis effectively improved, the self limitation of the data warehouse is broken through, and the method has the characteristics of simplicity and easiness in use while being compatible with mainstream type data warehouses.

Description

technical field [0001] The invention belongs to the technical field of data processing and data extraction, and in particular relates to a parallelized SQL automatic generation method. Background technique [0002] Data storage in the form of databases has become an indispensable method in many applications. Commonly used databases include Oracle, Mysql, GreenPlum, Hive, etc., which are the mainstream sources of data sets. The operations on common mainstream databases basically involve the structured SQL language. In the execution of data extraction, SQL has efficient performance because it is more suitable for the interaction at the bottom of the database, and it is also simple and easy to use. [0003] Data extraction is used as a pre-data set for data processing and subsequent modeling, and usually requires data personnel to manually write a large amount of SQL. Even though SQL has the characteristics of simplicity and ease of use, the process of writing SQL and correctin...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/2453G06F16/28G06F16/2455
CPCG06F16/24532G06F16/24542G06F16/283G06F16/24558G06F16/24556
Inventor 柴磊许靖温征
Owner 深圳市魔数智擎人工智能有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products