Data lineage analysis method and device

An analysis method and technology of an analysis device, applied in the field of cloud computing, can solve the problems of difficulty in flexible expansion and analysis of the origin of distributed database and non-relational database data, etc.

Active Publication Date: 2018-01-30
CHINA TELECOM CORP LTD
View PDF3 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] With the development of various database technologies, the SQL syntax of distributed databases and non-relational databases is no longer limited by the previous standard SQL specifications, and there will be many keywords or syntax formats that have been expanded by themselves. Therefore, the existing The technology based on the complete definition of standard SQL grammar is difficult to flexibly expand and analyze the origin of data in these distributed databases and non-relational databases.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data lineage analysis method and device
  • Data lineage analysis method and device
  • Data lineage analysis method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] The present disclosure will be described below with reference to the accompanying drawings. It is to be noted that the following description is merely explanatory and exemplary in nature, and in no way serves as any limitation of the present disclosure, its application or uses. Relative arrangements of components and steps and numerical expressions and numerical values ​​set forth in the embodiments do not limit the scope of the present disclosure unless otherwise specifically stated. Additionally, techniques, methods and devices known to those skilled in the art may not be discussed in detail but are intended to be part of the description where appropriate.

[0052] In order to solve the above-mentioned problems in the prior art, the following embodiments of the present disclosure propose a data lineage analysis method of a general structured query statement that can be flexibly expanded. In this method, metadata is first obtained, for example, Extract system definiti...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a data linage analysis method and device. The method comprises analyzing query sentences based on mode configuration to recognize target tables, target fields, source tables and source fields in the query sentences; obtaining metadata defined by various database systems or users and performing accurate matching on fuzzy fields of the query sentences through the metadata; generating data lineage relationships of the query sentences according to the field tracing sequence of the recognized target fields and source fields; analyzing the data lineage relationships of a plurality of the query sentences through multi-layer sentence analysis. By means of the method and the device, data lineage of various general structured sentences can be analyzed flexibly.

Description

technical field [0001] The present disclosure relates to the field of cloud computing, and in particular, to a data lineage analysis method and device. Background technique [0002] Data lineage refers to the contextual relationship between data. Data lineage analysis is to trace the source of query results to the database system to measure the credibility and quality of data. Through data lineage tracking, data credibility, quality, version information, etc. can be solved when distributing data sharing, and these problems can also be solved for various exported data sets. Through data lineage tracking, the evolution process of data in the data stream can be obtained. [0003] The current automatic data lineage analysis technology is mainly aimed at the standard SQL (StructuredQuery Language) language analysis of mainstream relational data, and analyzes the origin of data in SQL scripts through lexical analysis, syntax analysis and other technologies. [0004] With the dev...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
CPCG06F16/2433G06F16/24573
Inventor 陈翀陈康向勇张青吴旭刘春高智衡陶彩霞关迎辉
Owner CHINA TELECOM CORP LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products