Unlock instant, AI-driven research and patent intelligence for your innovation.

Parallel processing device and method for parallel multivalued reduction

A technology of parallel processing and equipment, applied in the direction of electrical digital data processing, processor architecture/configuration, multi-programming device, etc., which can solve problems such as computational difficulties, high throughput, and processing bottlenecks

Active Publication Date: 2021-10-15
GOOGLE LLC
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0010] However, performing multi-valued reductions usually requires serialization of multiple single-valued reductions
In many practical applications with extremely high throughput requirements, this limit is the processing bottleneck
For example, audio generative neural networks that model raw audio waveforms present significant computational challenges due to the fundamentally high-throughput nature of raw audio generation

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Parallel processing device and method for parallel multivalued reduction
  • Parallel processing device and method for parallel multivalued reduction
  • Parallel processing device and method for parallel multivalued reduction

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0066] Embodiment 1 is a method for performing parallel M value reduction by a parallel processing unit of a parallel processing device, the method comprising:

[0067] Performing multiple initial reductions in series, each initial reduction operates on data in different corresponding register spaces of at least M register spaces;

[0068] moving data from the M register spaces such that all results from the plurality of initial reductions are in the same first register space; and

[0069] One or more subsequent reductions are performed in parallel to compute the M final values, each subsequent reduction operating only on data in said first register space.

Embodiment 2

[0070] Embodiment 2 is the method of embodiment 1, wherein the parallel processing device is a graphics processing unit.

Embodiment 3

[0071] Embodiment 3 is the method of any one of embodiments 1-2, further comprising: moving each result of the one or more subsequent reductions into a corresponding register of the same parallel processing unit.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Methods, systems and apparatus, including computer programs encoded on computer storage media, for computing multi-valued reductions using parallel processing devices. One of the methods includes performing a parallel M-value reduction by a parallel processing unit of a parallel processing device. A plurality of initial reductions are performed serially, and each initial reduction operates on data in different corresponding register spaces of the at least M register spaces. Data is moved from the M register spaces such that all results from multiple initial reductions are in the same first register space. One or more subsequent reductions are performed in parallel to compute the M final values, each subsequent reduction operating only on data in said first register space.

Description

[0001] Cross References to Related Applications [0002] This application claims priority to U.S. Patent Application No. 15 / 839,637, filed December 12, 2017, the entire contents of which are hereby incorporated by reference. Background technique [0003] This specification relates to techniques for efficiently computing multivalued reductions in parallel using parallel processing hardware. [0004] A reduction is an operation that combines multiple values ​​into a single value. For example, reduction of 8 values ​​can be done by computing a single sum of 8 values. A reduction operation is typically performed by a parallel processing device, such as a graphics processing unit (GPU), to combine data computed by multiple threads executed by multiple independent processing units of the parallel processing device. [0005] The examples described in this specification will generally involve the independent processing unit being a streaming multiprocessor (SM) with multiple process...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F9/30
CPCG06F9/5066G06F9/3009G06T1/20G06F9/3885G06F9/3001G06F9/30098G06F9/46
Inventor 埃里希·康拉德·埃尔森桑德·艾蒂安·利亚·迪勒曼
Owner GOOGLE LLC