Supercharge Your Innovation With Domain-Expert AI Agents!

Copy access method and device for Hadoop distributed file system and Hadoop distributed file system

A technology of distributed files and access methods, applied in the field of distributed architecture, can solve the problems of uneven system load, not considering system load balancing, affecting the performance of write/read operations, etc., to achieve the effect of improving performance and balancing system load

Inactive Publication Date: 2014-11-19
SHENZHEN INSTITUTE OF INFORMATION TECHNOLOGY
View PDF2 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] However, the HDFS copy placement and reading strategies provided by the above-mentioned prior art only consider the network overhead of write / read operations, but do not consider the load balancing of other loads injected into the system such as CPU and disk, which may lead to serious uneven load on the system , thus affecting the performance of write / read operations

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Copy access method and device for Hadoop distributed file system and Hadoop distributed file system
  • Copy access method and device for Hadoop distributed file system and Hadoop distributed file system
  • Copy access method and device for Hadoop distributed file system and Hadoop distributed file system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] The invention provides a copy access method of the Hadoop distributed file system, the method comprising: obtaining node load information of data nodes in the Hadoop distributed file system; according to the node load information, storing a copy of the same data block in the client The data node where it is located or the data node with the lowest load; when reading the copy, read from the data node with the lowest load and / or the data node with the smallest distance from the client. The invention also provides a corresponding copy access device of the Hadoop distributed file system and the Hadoop distributed file system. Each will be described in detail below.

[0030] The copy access method of the Hadoop distributed file system in the embodiment of the present invention can be applied to the name node of the Hadoop distributed file system. The basic process of the copy access method of the Hadoop distributed file system provided by the embodiment of the present inven...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a copy access method and device for a Hadoop distributed file system and the Hadoop distributed file system so as to balance system load and accordingly improve reading / writing performance. The method comprises the steps that node load information of data nodes in the Hadoop distributed file system is acquired; copies with same block data are stored in a data node where a client is or a data node with the minimum load when stored according to the node load information; the copies are read from the data node with the minimum load and / or from the data node at the minimum distance from the client when read. According to the method, node load information of each data node is fully considered, so that the system load can be balanced, and the read-write operation performance is improved.

Description

technical field [0001] The invention relates to the field of distributed architecture, in particular to a copy access method and device of a Hadoop distributed file system and the Hadoop distributed file system. Background technique [0002] The distributed file system (Hadoop Distributed File System, HDFS) is a master-slave structure, and an HDFS cluster is composed of a name node and at least one data node. The name node in the HDFS cluster is a master server that manages the file namespace and regulates client access to files. It mainly operates file or directory operations in the file namespace, such as opening, closing, renaming, etc. At the same time, the name node also uses It is used to determine the mapping between blocks and data nodes. The data nodes in the HDFS cluster store specific data files, are responsible for read and write requests from file system clients, and also execute block creation, deletion, and block copy instructions from the name node. [0003...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30G06F9/50H04L29/08
CPCG06F9/5083G06F16/182H04L67/1001
Inventor 袁芳李靖张宗平李鸣亮叶剑锋
Owner SHENZHEN INSTITUTE OF INFORMATION TECHNOLOGY
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More