Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A data extraction method and system supporting resumed data transfer

A technology of data extraction and breakpoint resuming, applied in database management systems, transmission systems, structured data retrieval, etc., can solve the problem of inability to achieve data extraction in batches and pages, set the starting value of batch import data, and extract efficiency. Low problems, to avoid excessive memory consumption, ensure stability, and ensure accuracy

Active Publication Date: 2022-03-04
LINEWELL SOFTWARE
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] After the server fails, the data needs to be re-extracted, which makes the extraction efficiency low and wastes time
[0007] In the existing technology, information such as the initial value, interval value, maximum number of extracted records for each scheduling, and number of query database pages are not set according to the analysis data, resulting in the inability to realize data extraction and extraction in batches and pages low efficiency;

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A data extraction method and system supporting resumed data transfer
  • A data extraction method and system supporting resumed data transfer

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] In order to make the object, technical solution and advantages of the present invention more clear, the present invention will be further described in detail below in conjunction with the examples. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0042]The present invention limits the range of data extraction in batches, and the maximum number of records to be extracted for each schedule. If the number of records in the range is greater than the set maximum value, the interval time needs to be equally divided to obtain the former, and so on to know the range data that satisfies the extraction. The number is less than or equal to the maximum number of records set.

[0043] The status information of each paging extraction is recorded, that is, the extraction time cut-off point, the number of completed pages, and the total number of pages. When unforeseen problems such...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention belongs to the technical field of Internet information processing, and discloses a data extraction method and system supporting breakpoint resume transmission. The data extraction system includes a configuration database module, a setting module, an extraction data module, and a recording and storage module; the data extraction method is first based on The extracted data source information configures the extraction database, selects the incremental field; then sets the start time, interval time, maximum number of records extracted each time, number of records extracted by each scheduling page, and extracts data; then records and saves the number of process scheduling and the total number of processes. The number of records extracted, the number of records extracted last time in the process, the number of records extracted by the current schedule, the number of completed pages, and the total number of pages. After the server fails, the extraction will start from the next page that has been completed before. The data extraction method of the present invention can avoid excessive load on the server, ensure the stability of the server, prevent repeated extraction of completed data, and improve the efficiency of data extraction.

Description

technical field [0001] The invention belongs to the technical field of Internet information processing, and relates to a data extraction method and system supporting breakpoint resume transmission. Background technique [0002] At present, the existing technologies commonly used in the industry are as follows: [0003] In recent years, with the continuous development of the Internet and the rapid changes in human production and lifestyle, a large amount of business data has been accumulated. It is necessary to analyze, mine and extract a large amount of data into useful knowledge to form reports and charts, thus BI was born. ETL (Extract-Transform-Load) is the most important link in the formation of a data warehouse for BI projects. [0004] Commonly used ETL tools such as Datastage, Powercenter, Automatic, Kettle, NIFI, etc. have their own tools or processors for data extraction, transformation and loading. In the process of data extraction, it is necessary to ensure the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/25H04L67/101H04L67/1074
CPCH04L67/101H04L67/1078
Inventor 陈晓伟刘春忆
Owner LINEWELL SOFTWARE
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products