Unlock instant, AI-driven research and patent intelligence for your innovation.

Data extraction method and apparatus, computer storage medium and computer device

A data extraction and data storage technology, applied in the field of data processing, can solve problems such as slow running of data extraction tasks, failure of data extraction tasks, insufficient memory, etc., to achieve good user experience, reduce data volume, and reduce frequency.

Active Publication Date: 2019-02-19
360 TECH GRP CO LTD
View PDF4 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Due to the large amount of data to be extracted, more objects will be created, and frequent GC situations are likely to occur, resulting in slow running of the data extraction task. Most of the time, it is waiting for the GC time, and the data extraction task may fail. In this situation, the main reason for the failure of the data extraction task is memory overflow (commonly understood as insufficient memory)

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data extraction method and apparatus, computer storage medium and computer device
  • Data extraction method and apparatus, computer storage medium and computer device
  • Data extraction method and apparatus, computer storage medium and computer device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited by the embodiments set forth herein. Rather, these embodiments are provided for more thorough understanding of the present disclosure and to fully convey the scope of the present disclosure to those skilled in the art.

[0025] figure 1 A schematic flowchart of a data extraction method according to an embodiment of the present invention is shown. Such as figure 1 As shown, the method includes the following steps:

[0026] Step S100, when receiving a full table data query request, judge whether to switch the query mode according to the total number of rows and the average length of each row in the data table, and if so, execute step S101.

[0027...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a data extraction method and device, a computing device and a computer storage medium. The method comprises the following steps: when receiving the data inquiry request of thewhole table, judging whether to switch the inquiry mode according to the total number of rows and the average length of each row of the data table; If yes, the query method is switched from full tablequery method to paging query method; The corresponding data is extracted from the data source by using paging query and stored in the distributed buffer. Based on the scheme provided by the invention, by adopting the paging inquiry mode, the data amount of the data extracted each time can be reduced, thereby reducing the frequency of the occurrence of the GC situation, enabling the GC situation to reach a normal state, effectively controlling the memory overflow error to not occur again, improving the stability, and bringing a good use experience to the user.

Description

technical field [0001] The present invention relates to the technical field of data processing, in particular to a data extraction method, device, computing equipment and computer storage medium. Background technique [0002] In distributed computing tasks, Mysql data extraction is limited by cluster machine authorization and needs to be run in Yarn-client mode. When the amount of data extracted by Mysql is large, the pressure on the Driver side is high, and frequent garbage collection (GC (Garbage Collection)) will occur. [0003] Among them, GC means: after extracting a piece of data, it will load the data into the memory, and then create an object, which has a life cycle. Since the memory is limited, when the program no longer needs to use an object, it needs to destroy the object and release the memory resources it occupies, so that this space can be reused, and GC will occur. Due to the large amount of data to be extracted, more objects will be created, and frequent G...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F12/0806G06F12/0813G06F12/02
CPCG06F12/0253G06F12/0806G06F12/0813Y02D10/00
Inventor 徐皓朱海龙杜文玉沈迪王素梅李铮
Owner 360 TECH GRP CO LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More