Hadoop heterogeneity method and system based on storage and acceleration optimization

A heterogeneous system and heterogeneous technology, applied in the field of Hadoop heterogeneous systems, can solve problems such as low resource utilization, ReduceFPGA resource waste, acceleration, etc., and achieve the effects of reducing difficulty, improving application execution performance, and improving data read and write performance

Active Publication Date: 2017-08-29
HUAZHONG UNIV OF SCI & TECH
View PDF5 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] At present, many companies including Microsoft and Intel are doing the practice of integrating FPGA accelerators in large-scale data centers, but large-scale deployment of accelerators will bring many adverse effects
First of all, the cost of FPGA refactoring programming and optimization will be magnified during large-scale deployment. Since the customization of FPGA functions needs to be manually described in hardware language and finally compiled on the board, it will bring a lot of extra burden to developers; secondly , due to the high cost of FPGA itself, it is necessary to make a choice when weighing the acceleration of big data analysis applications and the cost of FPGA clusters. It is necessary to minimize the use of accelerators and improve the computing performance of applications as much as possible; then, due to the existing In the design scheme of the MapReduce programming model, the resource utilization rate of ReduceFPGA (protocol accelerator) is extremely low, and large-scale deployment of FPGA may cause a great waste of resources of ReduceFPGA itself

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Hadoop heterogeneity method and system based on storage and acceleration optimization
  • Hadoop heterogeneity method and system based on storage and acceleration optimization
  • Hadoop heterogeneity method and system based on storage and acceleration optimization

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention. In addition, the technical features involved in the various embodiments of the present invention described below can be combined with each other as long as they do not constitute a conflict with each other.

[0040] First, a specific application scenario of the system of the present invention is introduced. In actual scenarios, the system may consist of dozens or even thousands of computer nodes to form a cluster, and the computing nodes may belong to the same rack or different racks, or even be in different data centers. After deploying and configuring according to Hadoop2.0, install high-speed ser...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a Hadoop heterogeneity method and system based on storage and acceleration optimization, and belongs to the field of distributed calculation. According to the technical scheme, aiming at data processing requirements, a storage medium is divided into three types, including a solid storage medium, a common storage medium and a high-density storage medium, and the most appropriate storage mode is found for data of different types; meanwhile, an acceleration application of which calculation performance needs to be improved is positioned to an FPGA accelerator with a specific algorithm or a GPU accelerator to complete calculation so as to improve processing performance of the application, and algorithm functions and layouts of the FPGA and GPU accelerator can be statically switched. The invention further discloses the Hadoop heterogeneity system based on the storage and acceleration optimization. According to the Hadoop heterogeneity method and system based on the storage and acceleration optimization, reading and writing performance of a whole cluster and execution performance of an application task and the resource utilization rate of an acceleration device are improved.

Description

technical field [0001] The invention belongs to the field of distributed computing, and more specifically relates to a Hadoop heterogeneous system based on storage and acceleration optimization. Background technique [0002] Data mining and machine learning are attracting more and more attention in the industry, and the MapReduce (a distributed programming model) framework for big data processing applications is an extremely easy task due to the characteristics of its own Map (mapping) and Reduce (reduction) calculation stages. Parallel programming model. Due to the simplification of the Map and Reduce interfaces provided by the MapReduce framework to developers, many problems such as parallelism, scalability, and portability can be solved. Its open source implementation is Hadoop (a distributed system infrastructure). Single point of failure problem, decoupling and upgrading functions to YARN (another resource coordinator). [0003] With the limitation of the CPU process ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F3/06
Inventor 李瑞轩黄逸伟辜希武李玉华吴文哲薛正元杨琪王号召
Owner HUAZHONG UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products