ETL realization method for incremental data of Excel file

An implementation method and incremental data technology, applied in the direction of program control devices, etc., can solve the problems of lack of flexibility and applicability, and achieve the effect of wide application range, simple operation and strong flexibility

Active Publication Date: 2010-04-21
山东中创软件商用中间件股份有限公司
View PDF0 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, because Excel is widely used and its formats are diversified, most of the methods that support Excel data ...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • ETL realization method for incremental data of Excel file
  • ETL realization method for incremental data of Excel file

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] The present invention will be further explained and illustrated in non-limiting embodiments below.

[0036] An ETL implementation method for incremental data of Excel files, such as figure 1 As shown, the method starts at step 101, parses the Xml file, and obtains source configuration information.

[0037] Then enter step 102 to determine whether the extraction mode is real-time or timed or triggered by parsing the Xml file to obtain the content of the runMode item in the configuration information.

[0038] In step 103, according to the configuration information of the remoteFile item in Xml, it is determined whether to extract data from a remote or local Excel file. If it is a remote file, go to step 1041 to create a FileObject object in the base directory using a remote method; if it is a local file, Then go to step 1042, use the local method to create the FileObject object of the base directory.

[0039] Then go to step 105, extract the information related to the extraction...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an ETL realization method for incremental data of an Excel file, which comprises the following steps of: based on InfoSib-ExcelSource and InfoSib-ExcelSink components on an open source JavaApi, extracting file increment and whole volume of file content; and supporting the extraction of the file content of a specified part; supporting the extraction of a common file and a file in a special format separately; supporting file filtering, Sheet form filtering and data column filtering; supporting an Excel to an Excel, the Excel to database, and the database to the Excel; supporting long-distance Excel data extraction; supporting a repeated operations of one-time configuration; and supporting three extraction patterns, namely a real-time extraction pattern, a timely extraction pattern and a triggering extraction pattern.

Description

Technical field [0001] The invention relates to an ETL realization method for the incremental data of an Excel file. Background technique [0002] At present, the most convenient, fast and easy-to-use software in the process of data processing should be Microsoft Excel. It is widely used in the world today for data storage, simple calculation of data, and display of data. Therefore, in the ETL process of data, support for Excel should and must be included. However, it is precisely because of the wide application of Excel and the diversification of formats that most of the methods that support Excel data ETL are simply supporting Excel document data ETL in a fixed format. These methods lack effective flexibility and applicability. Summary of the invention [0003] The purpose of the present invention is to solve the above-mentioned shortcomings and provide a kind of InfoSib-ExcelSource and InfoSib-ExcelSink components based on open source JavaApi, which adopts the extraction of fi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F9/44
Inventor 扶文海舒琦陈俊
Owner 山东中创软件商用中间件股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products