Unlock instant, AI-driven research and patent intelligence for your innovation.

Method for realizing ETL conversion processing by utilizing programmable function expression based on XML description in big data scene

A conversion processing and big data technology, which is applied in special data processing applications, database management systems, structured data retrieval, etc., can solve problems such as the inability to reuse special architectures, inability to process logic, and insufficient SQL expressiveness, etc., to reduce the number of programs Coding work, improved data processing performance, effects with excellent native performance

Pending Publication Date: 2020-04-07
PRIMETON INFORMATION TECH
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0014] 1. The process of data import and SQLETL extraction consumes a lot of IO performance and computing resources;
[0015] 2. Unable to adapt to big data scenarios. Most databases are stored in separate databases and tables. When faced with the need to store billions or more data, they are often incapable;
[0016] 3. The expressive power of SQL is insufficient, and it cannot handle some complex logic;
[0017] 4. From the basic data warehouse to the theme database, it often needs to go through multiple associations and multiple cleaning services, and the intermediate tables take up a lot of storage space; the special architecture implemented by high-level programming language cannot be reused, and can only solve the problem in specific scenarios. ETL requirements

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for realizing ETL conversion processing by utilizing programmable function expression based on XML description in big data scene
  • Method for realizing ETL conversion processing by utilizing programmable function expression based on XML description in big data scene
  • Method for realizing ETL conversion processing by utilizing programmable function expression based on XML description in big data scene

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] In order to describe the technical content of the present invention more clearly, further description will be given below in conjunction with specific embodiments.

[0052] Under this big data scene of the present invention, based on XML description, realize the method that utilizes programmable function formula to carry out ETL conversion processing, which comprises the following steps:

[0053] (1) Load the control file and perform data file type segmentation;

[0054] (2) Load data files according to the preset batch size, traverse the data rows, and process them one by one;

[0055] (3) Perform data analysis;

[0056] (3.1) Perform basic type analysis;

[0057] carry out the assignment operation;

[0058] (3.2) Perform expression analysis;

[0059] Perform operations between two basic type variables or operations between a basic type variable and a basic type constant;

[0060] (3.3) Perform user-defined function type analysis;

[0061] (3.3.1) Determine wheth...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a method for realizing ETL (Extensible Transform and Load) conversion processing by utilizing a programmable function expression based on XML (Extensive Makeup Language) description in a big data scene, which comprises the following steps of: loading a control file, and segmenting the type of a data file; loading the data file according to a preset batch amount, traversingdata lines, and processing the data lines one by one; performing data analysis; carrying out field type verification; and selecting a control file, operating the data and outputting data, and storingthe data output data in various storage systems. By the adoption of the method for achieving ETL conversion processing through the programmable function expression based on XML description in the bigdata scene, data analysis work is submitted to XML analysis control, different configuration files are loaded according to different logistics enterprises, and different logistics data can be cleanedinto unified standard data. According to the technical scheme, a large amount of program encoding work is reduced, different data can be controlled and maintained, the performance is close to the excellent native performance, and the data processing performance is improved.

Description

technical field [0001] The present invention relates to the field of computer technology, in particular to the field of big data processing, and specifically refers to a method for implementing ETL conversion processing by using programmable functions based on XML description in a big data scene. Background technique [0002] ETL is the abbreviation of Extract-Transform-Load in English, which is used to describe the process of extracting, transforming, and loading data from the source to the destination. ETL is an important part of building a data warehouse. Users extract the required data from the data source, after data cleaning and processing, and finally load the data into the target data warehouse according to the pre-defined model to do various business intelligence. analysis or for master data management systems. [0003] Specifically: [0004] Data extraction: read data from various original business systems or unstructured documents; [0005] Data conversion: con...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/25G06F16/81
CPCG06F16/254G06F16/81
Inventor 赵平西顾伟王葱权
Owner PRIMETON INFORMATION TECH