Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

ETL (Extract Transform Load)-based data optimization method and equipment

A technology of data optimization and equipment, applied in the field of data processing, can solve the problems of memory bandwidth resource occupation, maintainability, poor usability, difficulty in operation, etc., achieve parallel optimization of branches, parallel optimization of records, and improve processing efficiency Effect

Active Publication Date: 2015-02-04
BEIJING JOIN CHEER SOFTWARE
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0011] These implementations usually have specific data processing logic for specific external data and loading targets, and these logics are solidified in an ETL program; so such implementations can only be used in specific ETL scenarios, and in other scenarios , it is impossible to use the previous results all the way to the new scene, or to reuse them, and can only complete a new realization for the new specific scene;
[0012] 2. Poor maintainability and usability
Maintaining such an ETL process involves the management of a large number of "scripts" or "codes", which is very confusing, and this puts forward quite high requirements for the technical level of the implementers, otherwise it will be difficult to achieve
[0014] 3. No metadata management
[0015] Some technical solutions, such as "hard coding" and "stored procedures", lack the process of storing and managing metadata, which makes it very difficult to run, track and analyze the ETL process, as well as maintain and adjust later
[0016] 4. Inefficiency
With the development, the amount of data in various industry systems is increasing. The ETL process usually has to deal with massive data, and the real-time requirements are getting higher and higher. Therefore, higher and higher requirements are put forward for the efficiency of the ETL process. Requirements, traditional processing methods can no longer meet the requirements
[0018] 5. It takes up a lot of resources
This leads to a large amount of memory, CPU, and bandwidth resources being occupied

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • ETL (Extract Transform Load)-based data optimization method and equipment
  • ETL (Extract Transform Load)-based data optimization method and equipment
  • ETL (Extract Transform Load)-based data optimization method and equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0039] In the process of enterprise informatization construction, especially the business intelligence business process for analysis and mining, it often involves processing a large amount of scattered and heterogeneous data, and ETL is an essential part of this process. The abbreviations and key terms related to the present invention are firstly introduced below.

[0040] ETL: Abbreviation of Extract-Transform-Load, which is the process of data extraction, tr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention provides an ETL (Extract Transform Load)-based data optimization method and ETL-based data optimization equipment. The method comprises the following steps of: previously arranging a plurality of data processing units according to a data extract, transform and load process ETL; previously setting a communication mechanism for the data processing units; acquiring instruction information including source data input by a user; constructing a data processing flow corresponding to the instruction information according to the source data; and optimizing the data processing flow according to the data processing units and the preset communication mechanism. By previously setting the data processing units and the communication mechanism, simplified optimization, branch parallel optimization and parallel optimization between records of data are realized, the processing efficiency of data optimization is increased, and hardware resources are saved.

Description

technical field [0001] The present invention relates to data processing technology, in particular to the processing technology in the process of data migration and conversion, specifically an ETL-based data optimization method and equipment. Background technique [0002] In the process of enterprise information construction, it often involves processing a large amount of scattered and heterogeneous data, and the process of data extraction, transformation, and loading (Extract-Transform-Load, ETL) is an essential part of the process. In the prior art, there are mainly the following ways to realize the ETL process: [0003] 1. Hardcoded [0004] Hard coding is an independently running program compiled by high-level languages ​​(such as C, C++) or scripts, or a dynamic link library embedded in the ETL framework. The advantage of this method is flexibility, as long as the data type and processing logic supported by the programming language used can be realized in the ETL proce...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
Inventor 李纪洲周徐波王星宇
Owner BEIJING JOIN CHEER SOFTWARE
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products