A distributed ETL data acquisition method and device
A data collection and distributed technology, applied in structured data retrieval, database management system, database distribution/replication, etc., can solve problems such as time-consuming, collection efficiency and progress cannot be guaranteed, and achieve the effect of improving efficiency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0089] In the above, the step S16), if yes, go to the concurrent processing sub-flow;
[0090] See figure 2 , The concurrent processing sub-process includes steps,
[0091] S161) The ETL data server fetches a task from the task list;
[0092] S162) The ETL data server calls the row lock mechanism of the ORACLE database, filters the task request instruction of one of the ETL execution servers to match the task, and locks the task record, then changes the task status to executing, and finally performs the task record Unlock
[0093] S163) The ETL data server returns pairing success information to the successfully matched ETL execution server, and pairing failure information to the remaining ETL execution servers.
[0094] The steps after S21 of the collection task execution flow include:
[0095] S22) The ETL execution server feeds back the completion information of the collection task to the ETL data server;
[0096] S23) The task execution status is updated to the execution completed.
...
Embodiment 2
[0099] In the above, in step S12, if yes, the ETL data server judges the execution status of the historical collection task corresponding to the current collection task in the task list. If the execution is completed, go to step S13, if it is in execution, the current task will not be added The task list returns to step S11, if it is to be executed, the current task is not added to the task list and returns to step S11.
[0100] In this embodiment, it is determined whether the task that is the same as the current collection task in the task list already exists. Because data collection usually repeats detection collection (incremental collection) according to a certain execution cycle, only one collection task needs to be set, and it can be repeatedly activated according to the set cycle. However, in order to avoid repeated collection in a short period of time, or overlap of the same collection task before and after, the comparison process between the current collection task and t...
Embodiment 3
[0103] In the above, in the step S13, the ETL data server initializes the date replacement flag table.
[0104] In this embodiment, the ETL data server initializes the date replacement flag table for the purpose of replacing the specific value of the condition when splicing input sentences. For example, replacing the macro string in the data source sql, thereby replacing the string represented by the original macro with data with real meaning.
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


