Star and snowflake schemas in extract, transform, load processes

a technology of star and snowflake, applied in the field of information warehouse systems, can solve the problems of time-consuming and inability to automate the process, and achieve the effect of avoiding the need for manual processing and establishing the structure for performing the etl process

Inactive Publication Date: 2013-05-09
IBM CORP
View PDF5 Cites 21 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

[0005]Accordingly, embodiments of the present invention include a computer-implemented method, computer program product and a system for supporting star and snowflake data schemas for use with an Extract, Transform, Load (ETL) process, comprising selecting a data source comprising dimensional data, where the dimensional data comprises at least one source table comprising at least one source column, importing a data model for the dimensional data into a data integration system, analyzing the imported data model to select a target data schema for the dimensional data, where the target data schema is a star data schema or a snowflake data schema, and where the target data schema comprises target dimensions and target facts, generating a meta-model representation by, for each target fact, mapping at least one source table or source column to the target fact, and for each target dimension, mapping at least one source table or source column to the target dimension, automatically converting the meta-model representation into one or more ETL jobs, and executing the one or more ETL jobs to extract the dimensional data from the data source and loading the dimensional data into the selected target data schema in a target data system.

Problems solved by technology

Establishing a structure for performing an ETL process is time-consuming, and complex, and there is no automated way to identify and handle loading of data into star and snowflake schemas while building ETL jobs.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Star and snowflake schemas in extract, transform, load processes
  • Star and snowflake schemas in extract, transform, load processes
  • Star and snowflake schemas in extract, transform, load processes

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0016]Referring now to the Figures, an exemplary computing platform or system 100 according to an embodiment of the present invention is illustrated in FIG. 1. The exemplary computing platform 100 comprises one or more data sources 105, a data integration system 110, one or more target systems 115, one or more end-user systems 120, and a database system 125. The platform or system 100 facilitates integration of data from various data sources 105 in different formats into the target systems 115.

[0017]The data sources 105 may include a wide variety of databases or storage structures residing at the same or different locations on one or more networks or systems. The target systems 115 may be in the form of computer systems, and may include databases (e.g., a data warehouse) or processing platforms used to further manipulate the data from the data integration system 110. The data sources 105 and target systems 115 may be implemented by any quantity of any type of conventional or other d...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A computer-implemented method, computer program product and a system for supporting star and snowflake data schemas for use with an Extract, Transform, Load (ETL) process, comprising selecting a data source comprising dimensional data, where the dimensional data comprises at least one source table comprising at least one source column, importing a data model for the dimensional data into a data integration system, analyzing the imported data model to select a star or snowflake target data schema comprising target dimensions and target facts, generating a meta-model representation by mapping at least one source table or source column to each target fact and target dimension, automatically converting the meta-model representation into one or more ETL jobs, and executing the ETL jobs to extract the dimensional data from the data source and loading the dimensional data into the selected target data schema in a target data system.

Description

BACKGROUND[0001]1. Technical Field[0002]The present invention relates generally to information warehouse systems, and more particularly to supporting star and snowflake data schemas in order to improve Extract, Transform, Load processing.[0003]2. Discussion of Related Art[0004]Enterprises are building increasingly large information warehouses to enable advanced information analytics and to improve the business value of information. The data in the warehouses are loaded via Extract, Transform, Load (ETL) processes, which extract data from a source, transform the data into a suitable form according to particular business needs, and then load the data into the warehouse(s). Establishing a structure for performing an ETL process is time-consuming, and complex, and there is no automated way to identify and handle loading of data into star and snowflake schemas while building ETL jobs. Conventional ETL systems require a user to manually write several dozen jobs for loading data into a typ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(United States)
IPC IPC(8): G06F17/30
CPCG06F17/30563G06F17/30589G06F17/30592G06F16/254G06F16/282G06F16/283
Inventor BHIDE, MANISH A.MITTAPALLI, SRINIVAS KIRANPADMANABHAN, SRIRAM
Owner IBM CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products