Parallel accelerating method and system in heterogeneous computation

A heterogeneous and computing result technology, applied in computing, general-purpose stored program computer, program control design, etc., can solve problems affecting GPU acceleration efficiency, etc.

Active Publication Date: 2017-05-31
安徽智慧皆成数字技术有限公司
View PDF5 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The higher the bandwidth of the master node, the more slave nodes communicate with the master node at the same time, but in practical applications, due to hardware limitations, the bandwidth of the master node can only allow communication with a small number of slave nodes at the same time; if there are many slave nodes, then The longer the time for each GPU to complete a data update; if the bandwidth of the master node remains unchanged, as the size of the slave node increases, the time for each GPU data update will increase linearly, which seriously affects the acceleration efficiency of the GPU.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Parallel accelerating method and system in heterogeneous computation
  • Parallel accelerating method and system in heterogeneous computation
  • Parallel accelerating method and system in heterogeneous computation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0049] In order to enable those skilled in the art to better understand the solutions of the embodiments of the present invention, the embodiments of the present invention will be further described in detail below in conjunction with the drawings and implementations.

[0050] Such as figure 2 As shown, it is a flowchart of a parallel acceleration method in heterogeneous computing in an embodiment of the present invention, including the following steps:

[0051] Step 201, predetermine the topology structure of data transmission between GPUs according to the number of GPUs.

[0052] The GPU can be stored in one server or in multiple servers. The number of GPUs is a power of 2, and the specific number can be predetermined according to actual application requirements, which is not limited in this embodiment of the present invention.

[0053] In the embodiment of the present invention, the data transmission topology between GPUs is mainly aimed at the case where the total number ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a parallel accelerating method and system in heterogeneous computation. The method comprises the following steps: determining a topological structure of data transmission between GPUs (Graphics Processing Unit) in advance according to the quantity of the GPUs; acquiring a current task by each GPU in the topological structure and computing data in the current task, so as to obtain a calculating result corresponding to the current task; sharing the calculating result which is obtained by each GPU and corresponds to the current task to the other GPUs in the topological structure; after obtaining the calculating results of the other GPUs in the topological structure by each GPU, starting to execute the next task. By adopting the parallel accelerating method and system in the heterogeneous computation, the parallel computation capability of the GPUs can be improved and the bandwidth requirements on nodes of each GPU are reduced.

Description

technical field [0001] The invention relates to the field of parallel computing, in particular to a parallel acceleration method and system in heterogeneous computing. Background technique [0002] GPU (Graphic Processing Unit, graphics processor), which is mainly used for various parallel calculations, such as the calculations required for drawing computer graphics, including vertex settings, light and shadow, pixel operations, etc. In the GPU, most of the transistors are used to implement calculations, and only a few transistors are used for logic control, cache, etc., so that the GPU has strong computing capabilities and is very suitable for processing simple logic but huge calculations. Tasks, such as deep learning tasks, commonly use GPU devices to calculate the parameters of each layer of neural network models. In order to cope with increasingly large training data or increasingly complex computing tasks, more and more GPUs need to work together to complete computing ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F9/50G06F15/80
CPCG06F9/5066G06F15/8038
Inventor 吴婷张致江王智国于振华胡国平胡郁
Owner 安徽智慧皆成数字技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products