Method and system for validating data

a data validation and data technology, applied in multi-dimensional databases, databases, instruments, etc., can solve the problems of not meeting performance requirements, transaction processing type relational database systems cannot meet all such requirements, and user queries become increasingly complex, so as to ensure data accuracy, reduce workload for checking data problems, and validate easily

Inactive Publication Date: 2012-07-10
KYNDRYL INC
View PDF11 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

[0021]By using the method and system according to the embodiments of the present invention, it is possible to validate easily whether data presented to users is problematic, and further determine, if a problem exists in the data, where the problem exists in a Business Intelligence (BI) solution. Therefore, data accuracy is ensured on the one hand, but on the other hand the workload for checking data problems is greatly reduced.

Problems solved by technology

Meanwhile, query requirements from users also become increasingly complex, which involves not only querying or manipulating one or more pieces of records in a relational table but also performing data analysis and information syntheses on tens of millions of pieces of recorded data in a plurality of tables.
However, a transaction processing type relational database system cannot meet all such requirements.
For operation and analytical type applications, they cannot meet performance requirements; thus, people always release the restriction on redundancy in a relational database and introduces statistical and integrated data.
However, the application logics of such statistical and integrated data are dispersed, random, and unsystematic; thus, the analytical function is limited, inflexible, and difficult to maintain.
However, since there are varieties of data sources and processing of ETL model and OLAP model involves a great mount of data, error likely occurs during the BI data processing process.
However, since the data amount in the report is too high, a comprehensible comparison is usually impossible.
Besides, even if it is found that the data in the report is inconsistent with the original data in the application system, it is impossible to determine the cause of the problem.
The workload for comprehensively checking data in the models and data warehouse is overwhelmingly large, which always needs considerable time to determine the cause of the problem.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for validating data
  • Method and system for validating data
  • Method and system for validating data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032]Hereinafter, the present invention will be described with reference to the methods and system according to the embodiments of the present invention, wherein each block in the flow charts and / or block diagrams and combination of each block in the flow charts and / or block diagrams of the present invention may be implemented with computer program instructions. These computer program instructions may be provided to a processor of a computer or other programmable data processing apparatus such that these instructions executed through the computer or other programmable data processing apparatus implement functions / operations specified in the blocks of the flow charts and / or block diagrams presented herein.

[0033]These computer program instructions may also be stored in a computer-readable hardware storage medium capable of instructing the computer or other programmable data processing apparatus to work in a particular manner, such that the instructions stored in the computer-readable...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A method and system for validating data. Warehouse data is generated by transforming source data via an ETL transformation model. A data cube is generated by transforming the warehouse data via an OLAP transformation model. A report dataset (MDS1) is generated from the data cube. A reference dataset (S) is generated from the source data. Whether MDS1 matches S is determined. If MDS1 doesn't match S, then an OLAP inverse transformation is performed on MDS1 to generate an OLAP dataset (MDS2) and whether MDS2 matches S is determined. If MDS1 doesn't match S and MDS2 does not match S, then an ETF inverse transformation is performed on MDS2 to generate an ETL dataset (MDS3) and whether MDS2 matches MDS1 and whether MDS3 matches S is determined. If MDS1 doesn't match S and MDS2 does not match S and MDS3 does not match S, then whether MDS3 matches MDS2 is determined.

Description

FIELD OF THE INVENTION[0001]The present invention relates to data processing technology, and in particular, to a method and a system for validating data.BACKGROUND OF THE INVENTION[0002]With the development of information technology, more and more people begin to use relevant technology on business intelligence to analyze and process business data to provide powerful support for decision-makers. Also, with the development and application of database technology, the data amount stored in a database rocketed high from mega (M) bytes and gigabytes (G) in the 1980s to current trillion (T) bytes and peta (P) bytes. Meanwhile, query requirements from users also become increasingly complex, which involves not only querying or manipulating one or more pieces of records in a relational table but also performing data analysis and information syntheses on tens of millions of pieces of recorded data in a plurality of tables. However, a transaction processing type relational database system cann...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(United States)
IPC IPC(8): G06F7/00G06F17/00
CPCG06F17/30592G06F17/30563G06F16/254G06F16/283G06F7/026
Inventor LI, XUE C.FU, XIAO J.GAO, XUE F.XIN, XIN
Owner KYNDRYL INC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products