Multi-source heterogeneous data quality detection method based on petri network

A multi-source heterogeneous data and quality detection method technology, applied in the direction of structured data retrieval, structured data browsing, digital data information retrieval, etc., to achieve the effect of enhancing the value of data

Active Publication Date: 2021-03-23
中科大数据研究院
View PDF14 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] Aiming at the defects and problems existing in existing equipment, the present invention provides a multi-source heterogeneous data quality detection method based on petri net, which effectively solves the problem t

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-source heterogeneous data quality detection method based on petri network
  • Multi-source heterogeneous data quality detection method based on petri network
  • Multi-source heterogeneous data quality detection method based on petri network

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0031] Example 1: This embodiment is intended to provide a multi-source heteroader data detection method based on Petri nets, which is often only for a single data source, a single data format quality detection, and the result is lacking. It can be interpreted that the environment of the current big data cannot be consistent, and the present embodiment is based on the Petri web model, which is suitable for systems for multi-source heteroading data.

[0032]In this embodiment, a multi-source heterogeneous data detecting system based on Petri nets includes a data quality rule library module, a data quality task management module, a data quality job module, a scheduling plan management module, a data quality analysis module and a data quality knowledge base module. The data quality rule library module is a collection of data quality rules, providing quality rules for quality detection; data quality task management module as execution unit of data quality detection, first selecting an...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a petri network-based multi-source heterogeneous data quality detection method, which can use a scheduling plan module to configure data quality jobs and then use a message feedback mechanism of a petri network to form a table and chart form quality analysis report for each time of quality jobs and finally accumulate data quality detection problems. A data quality problem knowledge base is formed, sustainable development opinions and suggestions are provided for subsequent data quality detection, it is guaranteed that all data sources are independent of one another, SQLforms are unified, and the problem that multi-source heterogeneous data needs a specific query method for different data sources is solved. A graphical and draggable quality task and job management method is provided for a user, a quality detection scheduling plan can be configured, a data quality analysis result is generated for the user to check, and a data quality knowledge base is establishedto improve the capability of solving quality problems. Effective support is provided for system data quality improvement, so that the data value is improved.

Description

technical field [0001] The invention belongs to the technical fields of big data, data governance and data analysis, and in particular relates to a petri-net-based multi-source heterogeneous data quality detection method. Background technique [0002] The information age has now transitioned to the era of big data, artificial intelligence, and the Internet of Things. In the fields of big data analysis, data warehouse construction, machine learning, neural networks, and Internet of Things data transmission, data is the cornerstone of all of these. Data quality is crucial in the use of data. Only with quality-assured data can algorithms get closer to the correct answer. [0003] In the context of big data, data governance has the problems of huge data volume, various data sources, and various data representation methods. With the introduction of the concept of Data Lake, big data governance does not require prior structured processing of data, and data is stored in relational...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/215G06F16/2455G06F16/26G06F16/28
CPCG06F16/215G06F16/2455G06F16/26G06F16/28G06F16/284
Inventor 贵恒冯凯王元卓王洪显
Owner 中科大数据研究院
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products