Sql automatic semantic analysis-based data lineage analysis system and method

An automatic parsing and analysis system technology, applied in the field of data lineage analysis, can solve problems such as data inapplicability, and achieve the effect of saving time and energy

Inactive Publication Date: 2017-09-15
GUANGDONG KINGPOINT DATA SCI & TECH CO LTD
View PDF4 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, this kind of system can only solve the situation where data is processed by ETL using t...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Sql automatic semantic analysis-based data lineage analysis system and method
  • Sql automatic semantic analysis-based data lineage analysis system and method
  • Sql automatic semantic analysis-based data lineage analysis system and method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] The present invention will be described in further detail below by means of specific embodiments:

[0039] The reference signs in the accompanying drawings of the description include: sql preprocessing module 10, keyword rule base building unit 11, data model and data processing sql script extraction unit 12, script decomposition unit 13, lineage identification module 20, keyword discovery unit 21. Lineage extraction unit 22, lineage presentation module 30, lineage presentation unit 31.

[0040] Such as figure 1 As shown, the data pedigree analysis system based on the automatic analysis of sql script semantics in this embodiment is composed of a sql preprocessing module 10 , a pedigree identification module 20 and a pedigree presentation module 30 .

[0041] The sql preprocessing module 10 is responsible for setting up the keyword rule base, reads the data model structure to be detected and the sql script of data processing from the database where the data to be detect...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a sql automatic semantic analysis-based data lineage analysis system. The system comprises a sql preprocessing module, a lineage recognition module and a lineage display module connected in sequence, wherein the sql preprocessing module is used for establishing a keyword rule library, reading a to-be-detected data model structure and a data processing sql script from a database where to-be-detected data is located, and decomposing the data processing sql script to form a script analysis table; the lineage recognition module is used for carrying recognizing a keyword of the data processing sql script read from the sql preprocessing module, extracting lineage information from the data processing sql script corresponding to the keyword and storing the lineage information into the script analysis table. The invention furthermore discloses an analysis method of the sql automatic semantic analysis-based data lineage analysis system. According to the system and the method, lineage analysis can be carried out on other data on the basis of an ETL processing process.

Description

technical field [0001] The invention relates to the field of data lineage analysis, in particular to a data lineage analysis system and method based on SQL semantics automatic analysis. Background technique [0002] Data lineage (Lineage, Provenance, Pedigree) can also be translated as (lineage, origin, lineage, pedigree). It is a research field developed with the development of databases and networks in recent years. Its content mainly includes the calculation of data lineage. , storage, transmission and query, etc. For database systems, it is sometimes necessary to trace the source of query results to measure the credibility and quality of data, etc. [0003] With the rapid development of information technology, enterprises have accumulated more and more data assets. In order to support management decisions and fully tap the value of data, enterprises need to process and analyze a large amount of source data. With the increase of data volume, As the complexity of busines...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/24522G06F16/24564
Inventor 陶波许飞月陈乐焱
Owner GUANGDONG KINGPOINT DATA SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products