Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Data quality monitoring method and system and computer equipment

A data quality and monitoring system technology, applied in the Internet field, can solve problems such as inability to notify business personnel of problematic or wrong data, inability to verify data quality in time, and invalidity of data processing, so as to achieve data integrity and avoid information burying , Guarantee the effect of data quality monitoring

Pending Publication Date: 2021-08-27
上海淇馥信息技术有限公司
View PDF5 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] In order to solve at least one of the following technical problems: the data quality cannot be verified in a timely, effective and accurate manner, the problem data or wrong data cannot be notified to the corresponding business personnel in time, and the follow-up process data processing is invalid due to wrong data or problem data Sexuality, or even waste of cluster resources (or computing resources, etc.), and poor monitoring results due to data quality (problem data or wrong data)

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data quality monitoring method and system and computer equipment
  • Data quality monitoring method and system and computer equipment
  • Data quality monitoring method and system and computer equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0045] Below, will refer to Figure 1 to Figure 3 Embodiments of the data quality monitoring method of the present invention are described.

[0046] figure 1 It is a flowchart of an example of the data quality monitoring method of the present invention. Such as figure 1 As shown, the method includes the following steps.

[0047] Step S101, creating a data quality monitoring task, and storing the monitoring task information in a document database, the monitoring task information including monitoring configuration information and alarm configuration information.

[0048] Step S102, submit the monitoring task to the cluster-based data warehouse, so as to monitor the data in the offline data warehouse.

[0049] In step S103, the monitoring result obtained by executing the monitoring task and the corresponding monitoring task information are processed and stored in the relational database.

[0050] Step S104, detecting the relational database, and issuing an alarm according to...

Embodiment 2

[0114] The system embodiment of the present invention is described below, and the system can be used to implement the method embodiment of the present invention. The details described in the system embodiments of the present invention should be regarded as supplements to the above method embodiments; details not disclosed in the system embodiments of the present invention can be implemented by referring to the above method embodiments.

[0115] refer to Figure 4 , Figure 5 with Image 6 , the present invention also provides a data quality monitoring system 400, which is used to monitor the data quality of an offline data warehouse, the data quality monitoring system 400 includes: a creation module 401, used to create a data quality monitoring task, and the The monitoring task information is stored in the document database; the monitoring module 402 is configured to submit the monitoring task to the cluster-based data warehouse to monitor the data in the offline data wareho...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a data quality monitoring method and system and computer equipment, which are used for monitoring the data quality of an offline data warehouse, and the method comprises the following steps: creating a data quality monitoring task, and storing monitoring task information in a document database, the monitoring task information comprising monitoring configuration information and alarm configuration information; submitting the monitoring task to a data warehouse based on a cluster so as to monitor data in the data warehouse in an offline state; processing a monitoring result obtained by executing the monitoring task and corresponding monitoring task information, and then storing the monitoring result and the corresponding monitoring task information into a relational database; the relational database is detected, an alarm is given according to a detection result, and the detection result comprises the monitoring result and the alarm configuration information. By flexibly configuring the data quality monitoring task, more effective and timely data monitoring can be realized, the timeliness of alarm information (or notification) can be ensured, and the data quality monitoring can be effectively ensured.

Description

technical field [0001] The present invention relates to the Internet field, in particular to a data quality monitoring method, system and computer equipment. Background technique [0002] With the advent of the big data era, more and more applications and services are built based on data, and the importance of data is self-evident. The guarantee of data quality is the basis of all data analysis and data mining. The quality of data is directly related to the accuracy of information, and also affects the survival and competitiveness of enterprises. Therefore, ensuring data availability and security is an important link that cannot be ignored. [0003] The existing open source data quality monitoring component Apache Griffin is a model-driven solution. Based on the target data set, users can choose from different dimensions (such as checking whether the data quantity and data accuracy of the source end and the target end are consistent after the execution of the offline task, ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/215G06F16/22G06F16/28
CPCG06F16/215G06F16/2291G06F16/283
Inventor 吴江龙田继龙
Owner 上海淇馥信息技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products