Unlock instant, AI-driven research and patent intelligence for your innovation.

ETL process scheduling method and device

A process and scheduling model technology, applied in the field of data processing, can solve problems such as inability to query process scheduling model objects, scheduling time interval errors, and insufficient real-time scheduling, so as to ensure timeliness, avoid scheduling errors, and avoid process real-time degradation Effect

Active Publication Date: 2020-12-29
WUHAN DAMENG DATABASE
View PDF9 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In view of the above defects or improvement needs of the prior art, the present invention solves the problems of scheduling time interval error and insufficient real-time scheduling that may be caused by using Timer scheduling
[0006] Further, it also solves the problem that the effective process scheduling model object needs to change synchronously caused by the dynamic changes of the process model object and the scheduling model object in ETL scheduling, the problem that the process scheduling model object cannot be queried, and the problem that the exception cannot be handled

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • ETL process scheduling method and device
  • ETL process scheduling method and device
  • ETL process scheduling method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0029] When using ETL tools for process scheduling, for periodically scheduled scheduling tasks, it is necessary to perform timing and schedule and execute the process model object and scheduling object corresponding to the scheduling task when the preset scheduling time point is reached. However, the existing scheme of using a timer for timing is based on absolute time, does not catch exceptions, and is single-threaded scheduling, which will cause the process model object to fail to start executing at the correct time. Therefore, this embodiment provides a new scheduling method to avoid incorrect scheduling that may occur when using the timer for scheduling.

[0030] like figure 1 As shown, the specific steps of the method for ETL process scheduling provided by the embodiment of the present invention are as follows:

[0031] Step 101: Create a delay queue.

[0032] In the current process scheduling method, the basic processing model of Timer is the task queue model of singl...

Embodiment 2

[0064] Based on the ETL process scheduling method provided in Embodiment 1, in different specific application scenarios, it can also be supplemented and adjusted according to different usage requirements or actual scenarios.

[0065] In ETL process scheduling, a process model object can configure one or more scheduling tasks, and execute the same process model object at different scheduling times, such as executing file Load once and executing file Load every interval; a scheduling task can also It can be referenced by one or more process model object configurations, and execute multiple process model objects sequentially or concurrently at the same scheduling time, such as executing a preset ETL process at a fixed time point; multiple schedules can also be configured for multiple process model objects Tasks such as executing a pre-set ETL process once a day at a valid date time. In step 102, when encapsulating the scheduling task into a scheduling element, it is necessary to ...

Embodiment 3

[0080] On the basis of the method for ETL process scheduling provided by the above-mentioned embodiment 1 to embodiment 2, the present invention also provides a device for ETL process scheduling that can be used to implement the above method, such as Figure 4 Shown is a schematic diagram of the device architecture of the embodiment of the present invention. The apparatus for ETL process scheduling in this embodiment includes one or more processors 21 and memory 22 . in, Figure 4 A processor 21 is taken as an example.

[0081] Processor 21 and memory 22 can be connected by bus or other means, Figure 4 Take connection via bus as an example.

[0082] The memory 22 is a non-volatile computer-readable storage medium for an ETL process scheduling method, and can be used to store non-volatile software programs, non-volatile computer-executable programs and modules, such as those in Embodiment 1 to Embodiment 2 ETL process scheduling method. The processor 21 runs the non-volat...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the field of data processing, in particular to an ETL process scheduling method and device. The method mainly comprises following steps of creating a delay queue; packaging scheduling tasks needing to be executed regularly into scheduling elements, and putting the scheduling elements into a delay queue, with the scheduling elements comprising scheduling time, a process model object and a scheduling model object; when scheduling time of the scheduling elements is up, taking out the scheduling elements from the delay queue, and executing process model objects of the scheduling elements; and according to the scheduling model object type of the scheduling elements, calculating the scheduling time of the scheduling elements at a preset time node, and if the next scheduling time is effective time, modifying the scheduling time of the scheduling elements into the next scheduling time and putting the next scheduling time back to the delay queue so as to carry out nextscheduling. Through the delay queue and the process scheduling model object list, the periodic scheduling of the process can be conveniently and effectively managed, and the timeliness and accuracy ofthe process scheduling operation are ensured.

Description

【Technical field】 [0001] The invention relates to the field of data processing, in particular to a method and device for ETL flow scheduling. 【Background technique】 [0002] When using ETL for data processing and building a data warehouse, the extraction, cleaning, transformation, and loading of heterogeneous data sources are generally configured through the process designer, and are periodically scheduled and executed in the background of the server. [0003] In the existing ETL tools, the timer mechanism that comes with the programming language is generally used to manage the periodic scheduling of the process. The Timer will time the interval time of the process model objects that need periodic scheduling, and notify the system when the predetermined interval is reached. Start executing the process model object. However, the basic processing model of Timer is a task queue model for single-thread scheduling. When a task takes a long time to execute, it will affect the rea...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F9/48G06F16/25
CPCG06F16/254G06F9/4806G06F9/4843
Inventor 付铨梅纲高东升
Owner WUHAN DAMENG DATABASE