Unlock instant, AI-driven research and patent intelligence for your innovation.

A data acquisition method, device and system for a data warehouse

A data warehouse and data acquisition technology, applied in the field of data processing, can solve the problems of server performance deterioration and time lag in the data warehouse, and achieve the effect of improving efficiency and avoiding server performance deterioration

Active Publication Date: 2019-03-26
BEIJING JINGDONG SHANGKE INFORMATION TECH CO LTD +1
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The existing data collection methods for data warehouses have a time lag, and in order to ensure timeliness, the same tables in each database server are generally extracted concurrently, so as to ensure that the same tables are extracted to the data warehouse at similar time points , and then shorten the waiting time for the subsequent merging of the same table, but the extraction tasks of the data warehouse are relatively concentrated, resulting in relatively concentrated use of server resources in the data warehouse, and the server performance of the data warehouse will deteriorate.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A data acquisition method, device and system for a data warehouse
  • A data acquisition method, device and system for a data warehouse
  • A data acquisition method, device and system for a data warehouse

Examples

Experimental program
Comparison scheme
Effect test

no. 1 example

[0024] figure 1 It is an implementation flowchart of a data collection method for a data warehouse provided in the first embodiment of the present invention. The method can be executed by a data collection device for a data warehouse, where the device can be implemented by software and / or hardware, and can be used as a data Part of the warehouse is built inside the data warehouse. Such as figure 1 As shown, the implementation process includes:

[0025] Step 11, periodically detecting whether a preset data extraction event in at least one database server connected to the data warehouse is triggered.

[0026] Among them, the data warehouse, as a factory for data collection, data processing, and data output, supports various data requirements such as data analysis, reporting, and mining, and plays the role of data-driven value. The warehouses used to store commodities are distributed all over the country, and each warehouse independently deploys the same WMS (Warehouse Managem...

no. 2 example

[0040] On the basis of the foregoing embodiments, this embodiment provides a data collection method for a data warehouse.

[0041] figure 2 and image 3 All are flowcharts of realizing a data collection method of a data warehouse provided in the second embodiment of the present invention. combine figure 2 and image 3 , the data acquisition method of the data warehouse includes:

[0042] Step 21, the task scheduling unit in the data warehouse creates a data extraction task.

[0043] Still taking the connection between the data warehouse and the database servers corresponding to M warehouses as an example, the task scheduling unit creates M tasks, and each task is used to extract data from N tables in a database server.

[0044] Step 22, setting a time threshold.

[0045] For the task created by the task scheduling unit, set the time threshold (ie the latest extraction time), and the database server that has not been extracted by the time threshold will start to execute...

no. 3 example

[0057] Figure 4 It is a schematic structural diagram of a data collection device for a data warehouse provided in the third embodiment of the present invention, and the device can be built in the data warehouse. Such as Figure 4 As shown, the data acquisition device of the data warehouse includes a data extraction detection unit 31 and a data extraction storage unit 32 .

[0058] Wherein, the data extraction detection unit 31 is used to regularly detect whether the preset data extraction event in at least one database server connected to the data warehouse is triggered;

[0059] The data extraction storage unit 32 is configured to extract and store data in any one of the database servers when it is detected that a data extraction event in the database server is triggered.

[0060] Optionally, the data extraction detection unit 31 includes:

[0061] The check-in detection subunit is used to regularly detect whether the check-in event in each of the database servers is trig...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention, belonging to the technical filed of data processing, relates to a method, apparatus and system for collecting data from a data warehouse. The method comprises: periodically detecting whether a data extraction event preset in at least one database server that is connected to the data warehouse is triggered; and extracting and storing data in the database server when it is detected that a data extraction event preset in any of the at least one database server is triggered. According to the method, trigger events for extracting data from database servers are respectively determined, thereby dispersing the time for performing data extraction tasks, preventing degradation of server performance of the data warehouse caused by concurrent data extraction tasks, and improving data collection efficiency.

Description

technical field [0001] The invention belongs to the technical field of data processing, and relates to a data collection method, device and system of a data warehouse. Background technique [0002] As a factory for data collection, data processing, and data output, the data warehouse supports various data requirements such as data analysis, reporting, and mining, and plays the role of data-driven value. Among them, the data warehouse collects data from the distributed database server is a key step for data to enter the data warehouse. The quality and timeliness of the access data directly affect the processing and output quality of data in the data warehouse. [0003] At present, the data collection of the data warehouse has a time lag. In order to collect all the data of the previous day, a time threshold is generally set after 24:00 of the day, and a data extraction task is bound through the task scheduler. Start the data extraction task to extract data from each database...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/25
CPCG06F16/254
Inventor 尹翔
Owner BEIJING JINGDONG SHANGKE INFORMATION TECH CO LTD