Data quality monitoring method and apparatus, and big data computing platform
A data quality and computing platform technology, applied in the field of data processing, can solve problems such as insufficient evaluation of service quality, false alarms, and inaccurate alarms, etc., to achieve the optimization of big data computing services, reduce false alarm rates, and reduce workload Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0046] This embodiment adopts the method of statistical prediction, predicts the current data of the parameter according to the historical data of the parameter, collects the current data of the parameter and compares it with the predicted result, and performs alarm processing according to the comparison result. This embodiment relates to data quality monitoring of data in a data warehouse, and the data warehouse is set in a computing cluster that provides big data computing services. But the present invention is not limited thereto, and can also be used for data quality monitoring of data in other systems and other nodes.
[0047] Such as figure 1 As shown, the data quality monitoring method of this embodiment includes:
[0048] Step 110, collect historical data related to the parameters to be monitored, and use the historical data as a sample to establish a prediction model for the parameters;
[0049] Statistical forecasting belongs to the research category of forecasting...
Embodiment 2
[0094]In order to solve the problem that existing big data computing services cannot provide users with data quality monitoring, this embodiment provides a user data quality monitoring method for big data computing services, such as image 3 As shown, the method includes:
[0095] Step 210, the big data computing platform collects historical data related to the parameters to be monitored from the saved user data, predicts the current data of the parameters according to the historical data, and obtains a prediction result;
[0096] The big data computing platform may be a cloud computing platform, etc., and the user data may be personal user data or enterprise user data. The content of the data can be various types of data such as data generated by the user based on the big data computing platform for business processing, log data, or data generated by accessing the big data computing platform.
[0097] The user data alarm of the big data computing service can be used as a val...
example 1
[0130] The data monitored in this example is the data in the offline data warehouse in the cloud computing system, and the data in the offline warehouse needs to be regularly imported from the front-end database such as mysql database or oracle business database. This part of imported data exists in the form of data tables, which can be called source data or source tables. Often, some summary tables can be generated based on these source data. The monitored data can be the data related to the source table, or the data related to the summary table generated based on the source table.
[0131] In this example, when monitoring data quality, an appropriate prediction model can be selected according to the characteristics of the data table. For example, when monitoring the number of records in the list of registered users, because after the user registers, even if the logout does not delete the record, but only marks its status as logout, the number of records in the list of regis...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com