Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Cluster database data loading tool and method capable of supporting various data sources

A data loading and database technology, which is applied in the field of cluster databases, can solve problems such as failure to read data, insufficient permissions, and loading failure, and achieve the effects of enhanced authentication functions, reduced coupling probability, and reduced work costs

Active Publication Date: 2017-02-15
TIANJIN NANKAI UNIV GENERAL DATA TECH
View PDF3 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] When using the loading tool to read data from a data source that requires security authentication, you must know the corresponding authentication method and configure the loading tool correctly. Otherwise, you will lack sufficient permissions when reading data, resulting in loading failure.
On the other hand, in the same data source, there may be multiple authentication methods coexisting and functioning at the same time. If the loading tool cannot handle this combination of authentication methods, it will also fail to read data due to insufficient permissions.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cluster database data loading tool and method capable of supporting various data sources
  • Cluster database data loading tool and method capable of supporting various data sources
  • Cluster database data loading tool and method capable of supporting various data sources

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0037] Embodiment: An embodiment of the present invention is a data loading process including three loading tasks, including different data sources and authentication methods: task one, HDFS file+simple authentication; task two, local file+no authentication required; task 3. HDFS file + simple authentication + HTTP proxy authentication. The loading and running process is as follows:

[0038] 1. Start the loading tool.

[0039] 2. Process input parameters.

[0040] 3. According to the parameters, the loading tool knows that there are 3 tasks, namely "Task 1", "Task 2" and "Task 3".

[0041] 4. Start to execute "Task 1".

[0042] 5. According to the parameters, determine to use the HDFS simple authentication method, create an authentication module, and create an authentication module list with a length of 1 (including the HDFS simple authentication module).

[0043] 6. Use the authentication module list and HDFS data source access module to read and load data.

[0044] 7. C...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a cluster database data loading tool and method capable of supporting various data sources, and aims to provide the cluster database data loading tool and method with the advantages of high speed in authentication and various authentication ways. The cluster database data loading tool comprises a loading tool use interface, a loading task management module, a loading task execution module, a data reading module, a data source access module, a data writing module and a database access module, wherein the loading tool use interface is used for starting loading tasks and transferring the parameter of each task to the loading task management module; the loading task management module is used for storing, maintaining and scheduling all received loading tasks; the loading task execution module is in charge of executing the specific loading task; the data reading module is used for providing a uniform data reading interface; the data source access module is used for reading data from a specific data source; the data writing module is used for providing a uniform data writing interface; and the database access module is used for loading the received data to a cluster database.

Description

technical field [0001] The invention relates to the field of cluster databases, in particular to a cluster database data loading tool and method supporting multiple data sources. Background technique [0002] The cluster database uses multiple ordinary servers to provide high-performance database services, and is suitable for the storage, query and analysis of massive data. Because it is suitable for big data occasions, it needs to cooperate with high-performance data loading tools to read data from various data sources and load them into the cluster database. In practical applications, due to information security considerations, data sources often have a security authentication mechanism, and there are multiple security mechanisms to choose from, and the loading tool must support the corresponding authentication mechanism in order to successfully read from the data source. data file. For example, HDFS data source supports three different authentication mechanisms: simple ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F21/45
CPCG06F16/254G06F21/45
Inventor 赵伟武新杨丽莹
Owner TIANJIN NANKAI UNIV GENERAL DATA TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products