Parallel model processing method and device based on multiple graphics processing units

一种处理器、多图形的技术,应用在计算领域,能够解决存在等待、性能不足、没有充分利用总线带宽等问题,达到提升效率的效果

Active Publication Date: 2014-09-10
SHENZHEN TENCENT COMP SYST CO LTD
View PDF4 Cites 76 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] The point-to-point data exchange model has serious shortcomings in performance: when more than 2 parallel units are required, more data exchange cycles are required, and there is waiting in each data exchange cycle, and the idle bus bandwidth is not fully utilized.
The existing technology uses a fixed learning rate to update parameters, and the training experiment process involves a lot of manual adjustment of the learning rate and judging the convergence work, which is complicated and cumbersome and low in efficiency

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Parallel model processing method and device based on multiple graphics processing units
  • Parallel model processing method and device based on multiple graphics processing units
  • Parallel model processing method and device based on multiple graphics processing units

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0064] In order to further explain the technical means and effects of the present invention to achieve the intended purpose of the invention, the specific implementation, structure, features and effects of the present invention will be described in detail below in conjunction with the accompanying drawings and preferred embodiments.

[0065] figure 2 A structural block diagram of a computer or server is shown. Such as figure 2 As shown, server 100 includes memory 102 , one or more processors 104 , storage controller 106 , peripherals interface 108 , and one or more GPUs 110 . understandable, figure 2 The shown structure is only for illustration, and does not limit the structure of the server 100 . For example, server 100 may also include figure 2 more or fewer components than shown in, or with figure 2 Different configurations are shown.

[0066] The memory 102 can be used to store software programs and modules, such as program instructions / modules corresponding to...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a parallel model processing method based on multiple graphics processing units (GPUs). The method includes the steps: creating multiple Workers used for respectively controlling multiple Worker Groups in a central processing unit (CPU), wherein each Worker Group comprises the GPUs; binding each Worker with one corresponding GPU; loading one Batch of training data from a nonvolatile memory into a GPU video memory corresponding to one Worker Group; transmitting data, needed by the GPUs for data processing, among the GPUs corresponding to one Worker Group in a Peer to Peer manner; controlling the GPUs to perform data processing in parallel through the Workers. By the method, efficiency of parallel data processing of the GPUs can be improved. Besides, the invention further provides a parallel data processing device.

Description

technical field [0001] The invention relates to the technical field of computing, in particular to a data parallel processing method and device based on multiple graphics processing units (GPUs). Background technique [0002] Deep neural network technology is currently a relatively popular technology in the field of machine learning. There have been successful cases in academia and industry, including speech recognition, image recognition, natural language processing, and advertising recommendation. The article "Large-scale Deep Unsupervised Learning using Graphics Processors" published by Rajat Raina, Anand Madhavan, Andrew Y.Ng et al. at the 26th International Machine Learning Conference (Montreal, Canada, 2009) introduces the use of a single graphics processor ( GPU) training method and system implementation of deep neural network; Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Image Classification (ImageNet Classification with Deep Convolutional Neural Networks)" introduc...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06T1/40
CPCG06F9/522G06T1/20G06T2200/28
Inventor 邹永强李毅金涬肖斌郭志懋薛伟陈波李勇肖磊
Owner SHENZHEN TENCENT COMP SYST CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products