A Loading Client Implementation Method Oriented to Distributed Data Warehouse

A distributed data and implementation method technology, applied in the field of loading client implementation for distributed data warehouses, can solve the problems of slow loading efficiency and reduce loading time of distributed data warehouses, so as to improve data loading efficiency and reduce loading time , Improve the effect of loading efficiency

Active Publication Date: 2019-11-01
BEIJING SCISTOR TECH +1
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Aiming at the problem of slow loading efficiency of existing distributed data warehouses, the present invention provides a loading client-side implementation method for distributed data warehouses, which can not only solve the simultaneous loading of multiple data tables, load balancing of multiple data warehouse nodes, but also enable Use a small amount of memory to play the highest loading rate, effectively reduce the loading time, and promote the continuous improvement of the development of the big data system

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Loading Client Implementation Method Oriented to Distributed Data Warehouse
  • A Loading Client Implementation Method Oriented to Distributed Data Warehouse
  • A Loading Client Implementation Method Oriented to Distributed Data Warehouse

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0048] The present invention will be further described in detail below in conjunction with the accompanying drawings.

[0049] The present invention relates to a distributed fast loading method with multiple tasks, multiple data tables, and multiple data warehouse nodes, which can achieve the highest loading rate while occupying the minimum memory, and a schematic diagram of the data structure of the distributed data warehouse loading process, as shown in figure 1 As shown in the figure, it can be seen from the figure that the data processing flow during the entire data loading process includes the field structure and the data packet structure. Among them, the field content attribute, total data length and memory description are stored in the field structure, and the memory usage changes according to the field type, thereby reducing memory usage and improving copy efficiency; an organization that supports multi-threaded multi-table loading is given Structure, after the data pa...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses oriented distribution data warehouse high efficiency load client end realization method, belongs to information processing field; detailed steps are, first step, initializing system start parameter; load device manage module each applying a load device for every route; every route each building data exposition module, each client end data expositing and transmitting to load device; each load device respective calling data examine module and examining; then, storing after examination character section data in data buffer module, managing and transmitting to data transmission module; distribution node listen module acquiring every distribution data warehouse listen status, and transmitting to data transmission module; finally, data transmission module transmitting received buffer data to healthy distribution data warehouse. The method raises overall distribution data warehouse apply efficiency and data load efficiency, conforms to current apply requirement, explores broad apply prospect.

Description

technical field [0001] The invention belongs to the field of information processing, in particular to a method for implementing a loading client facing a distributed data warehouse. Background technique [0002] With the continuous development of computer technology and the continuous improvement of informatization, people use more and more distributed storage; different from the current common centralized storage technology, distributed storage technology does not store data in a certain or Instead, use the disk space on each machine in the enterprise through the network, and form a virtual storage device with scattered storage resources, and store data scattered in every corner of the enterprise. [0003] The loading efficiency of data in a distributed environment greatly affects the efficiency of the entire cluster; in order to improve data loading efficiency, optimize the use of the entire cluster, and reduce costs, a better and faster loading method is urgently needed. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/25
CPCG06F16/25
Inventor 王宇徐晓燕周渊刘利宏刘庆良郑彩娟黄成王振宇李斌斌周游
Owner BEIJING SCISTOR TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products