ETL (Extract Transform Load)-based data optimization method and equipment

A data optimization and equipment technology, applied in the field of data processing, can solve the problems of memory bandwidth resource occupation, maintainability, poor usability, large resource occupation, etc., and achieve branch parallel optimization, parallel optimization between records, and data simplification optimization Effect

Active Publication Date: 2012-12-12
BEIJING JOIN CHEER SOFTWARE
View PDF3 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0011] These implementations usually have specific data processing logic for specific external data and loading targets, and these logics are solidified in an ETL program; so such implementations can only be used in specific ETL scenarios, and in other scenarios , it is impossible to use the previous results all the way to the new scene, or to reuse them, and can only complete a new realization for the new specific scene;
[0012] 2. Poor maintainability and usability
Maintaining such an ETL process involves the management of a large number of "scripts" or "codes", which is very confusing, and this puts forward quite high requirements for the technical level of the implementers, otherwise it will be difficult to achieve
[0014] 3. No metadata management
[00

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • ETL (Extract Transform Load)-based data optimization method and equipment
  • ETL (Extract Transform Load)-based data optimization method and equipment
  • ETL (Extract Transform Load)-based data optimization method and equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] The technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only a part of the embodiments of the present invention, rather than all the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of the present invention.

[0039] In the process of enterprise information construction, especially the business intelligence business process oriented to analysis and mining, it often involves the processing of a large number of, scattered, and heterogeneous data. ETL is an indispensable part of this process. The following first introduces abbreviations and key terms related to the present invention.

[0040] ETL: Abbreviation for Extract-Transform-Load, that is, t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides an ETL (Extract Transform Load)-based data optimization method and ETL-based data optimization equipment. The method comprises the following steps of: previously arranging a plurality of data processing units according to a data extract, transform and load process ETL; previously setting a communication mechanism for the data processing units; acquiring instruction information including source data input by a user; constructing a data processing flow corresponding to the instruction information according to the source data; and optimizing the data processing flow according to the data processing units and the preset communication mechanism. By previously setting the data processing units and the communication mechanism, simplified optimization, branch parallel optimization and parallel optimization between records of data are realized, the processing efficiency of data optimization is increased, and hardware resources are saved.

Description

Technical field [0001] The present invention relates to data processing technology, in particular to processing technology in the process of data migration and conversion, and specifically is an ETL-based data optimization method and equipment. Background technique [0002] In the process of enterprise informatization construction, it often involves processing a large number of, scattered, and heterogeneous data. The process of data extraction, transformation, and loading (Extract-Transform-Load, ETL) is an essential part of this process. In the prior art, there are mainly the following ways to implement the ETL process: [0003] 1. Hard coding [0004] Hard coding is a stand-alone program compiled through high-level languages ​​(such as C, C++) or scripts or a dynamic link library embedded in an ETL framework. The advantage of this method is flexibility, as long as the data type and processing logic supported by the programming language used can be realized in the ETL process. Co...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
Inventor 李纪洲周徐波王星宇
Owner BEIJING JOIN CHEER SOFTWARE
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products