Supercharge Your Innovation With Domain-Expert AI Agents!

Transposition convolution and convolution accelerator chip design method based on systolic array

A pulsating array and accelerator technology, which is applied in the calculation using the number system, using non-contact manufacturing equipment for calculation, instruments, etc., to accelerate the training process, improve the acceleration performance, and improve the image processing performance.

Pending Publication Date: 2021-11-26
PEKING UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In order to overcome the problem of "overlap-accumulation" in the transposed convolution accelerator chip and the deficiencies of existing methods, the present invention proposes a method for designing a transposed convolution and convolution accelerator chip based on a systolic array, and develops an artificial intelligence accelerator chip. The purpose is to use the appropriate data flow strategy to directly perform chip-accelerated calculations on the transposed convolution without performing "0-insertion" operations or split operations on the transposed convolution in the chip.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Transposition convolution and convolution accelerator chip design method based on systolic array
  • Transposition convolution and convolution accelerator chip design method based on systolic array
  • Transposition convolution and convolution accelerator chip design method based on systolic array

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] Below in conjunction with the accompanying drawings, the present invention is further described through examples, but the scope of the present invention is not limited in any way.

[0030] The present invention provides a systolic array-based chip accelerator architecture that can be used to accelerate transposed convolution / convolution, and a corresponding data flow scheme for mapping transposed convolution / convolution onto the accelerator array. The accelerator array is a two-dimensional array that includes multiple processing units, interconnection networks, storage structures, and input / output modules.

[0031] with (K tc ,S tc ,I tc H ,I t W ,I tc C ,O tc C ) represents a transposed convolutional layer with parameter K tc ,S tc ,I tc H ,I t W ,I tc C ,O tcC Represents the weight kernel size value, step value, input image height, input image width, input channel value and output channel value of the transposed convolution layer, respectively. with...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a transpose convolution and convolution accelerator chip design method based on a systolic array, which comprises the following steps of: designing a chip accelerator array corresponding to different calculation modes and memory access modes of a transpose convolution network model and a convolution network model; and establishing a corresponding data flow strategy to map the transposed convolution model and the convolutional network model into the chip accelerator array, so that the chip acceleration performance is improved, the training process of the generative adversarial network and the convolutional neural network can be accelerated, and the image processing performance is effectively improved.

Description

technical field [0001] The invention belongs to the technical field of artificial intelligence accelerator chip design, and in particular relates to a systolic array-based transposed convolution and convolution accelerator chip design method, including processing unit design in the chip, interconnection network design, on-chip storage design and convolution / Transpose convolutional dataflow design. Background technique [0002] Currently, transposed convolution and convolution layer model methods have been widely used in various application scenarios of deep learning. Generally, convolutional layers can be used to extract useful information in the image, such as color, contour, texture and other features, and downsample the image. Convolutional layers are widely used in image recognition, machine translation and other scenarios. The transposed convolutional layer can insert valid information into the input image, that is, upsample the input image. Transposed convolutional...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F15/78G06F7/523G06F7/50G06F17/15G06N3/04G06N3/063
CPCG06F15/7807G06N3/063G06F7/50G06F7/523G06F17/153G06N3/045Y02D10/00
Inventor 罗国杰马征征
Owner PEKING UNIV
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More