Supercharge Your Innovation With Domain-Expert AI Agents!

A method and system for fast data loading of Hadoop

A data loading and data technology, applied in the computer field, can solve the problems that other nodes cannot load in parallel and the data loading efficiency is low, so as to achieve the effect of improving the loading efficiency

Inactive Publication Date: 2018-12-25
ZHENGZHOU YUNHAI INFORMATION TECH CO LTD
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Therefore, data can only be loaded to one datanode node first, and cannot be unconditionally loaded to other nodes in parallel, resulting in low data loading efficiency
[0003] Aiming at the problem of low data loading efficiency of external data loading to HDFS file system in the prior art, there is no effective solution yet

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method and system for fast data loading of Hadoop
  • A method and system for fast data loading of Hadoop
  • A method and system for fast data loading of Hadoop

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] In order to make the object, technical solution and advantages of the present invention clearer, the embodiments of the present invention will be further described in detail below in conjunction with specific embodiments and with reference to the accompanying drawings. It should be noted that all expressions using "first" and "second" in the embodiments of the present invention are to distinguish two entities with the same name but different parameters or parameters that are not the same, see "first" and "second" It is only for the convenience of expression, and should not be construed as a limitation on the embodiments of the present invention, which will not be described one by one in the subsequent embodiments.

[0033] Based on the above purpose, the first aspect of the embodiments of the present invention proposes an embodiment of a method capable of quickly loading different data to be loaded or different types of data to be loaded into HDFS. figure 2 What is sho...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a Hadoop data fast loading method and system, which comprises the following steps: collecting metadata information from a metadata node in real time and obtaining currently available data node information according to the metadata information by using DLS; the data to be loaded of the data node being divided into a plurality of data segments according to the data node information, and the currently available data nodes being simultaneously loaded with a plurality of data segments; receiving load completion information for all currently available data nodes to end the data load. The invention can quickly load different data to be loaded or different types of data to be loaded into the HDFS, and the loading efficiency is improved by a parallel loading mode.

Description

technical field [0001] The present invention relates to the computer field, more specifically, to a Hadoop data fast loading method and system. Background technique [0002] In the prior art Hadoop distributed big data system, external data is usually stored in a common file system of a certain Datanode node. If you load or import these external data into Hadoop, the loaded data will be stored in the HDFS file system of the same Datanode node first, and will be stored in the HDFS file system of other Datanode nodes only when the local HDFS file system is full. . Therefore, data can only be loaded to one datanode node first, and cannot be unconditionally loaded to other nodes in parallel, resulting in low data loading efficiency. [0003] There is currently no effective solution to the problem of low data loading efficiency of external data loaded into the HDFS file system in the prior art. Contents of the invention [0004] In view of this, the purpose of the embodiment...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
Inventor 魏本帅杜彦魁
Owner ZHENGZHOU YUNHAI INFORMATION TECH CO LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More