Convolution hardware acceleration method and convolution hardware acceleration circuit

A hardware acceleration and convolution technology, applied in the field of convolutional neural network and chip design, to achieve the effect of reducing repeated reading, improving utilization, and avoiding repeated reading

Active Publication Date: 2022-03-11
저장진셍일렉트로닉스테크놀러지컴퍼니리미티드
View PDF9 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the design of a dedicated chip for CNN convolution calculations will face two problems. One is how to increase the parallelism of operations to improve computing efficiency, and the other is how to reduce the read and write bandwidth of the memory to save computing power consumption.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Convolution hardware acceleration method and convolution hardware acceleration circuit
  • Convolution hardware acceleration method and convolution hardware acceleration circuit
  • Convolution hardware acceleration method and convolution hardware acceleration circuit

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] figure 1 A schematic diagram of the convolution operation. The color coding methods of digital images include RGB, YUV, YCbCr and so on. Taking RGB color coding as an example, each pixel in a digital image can be composed of red sub-pixels, green sub-pixels and blue sub-pixels; that is, if the resolution or resolution of a digital image is W*H square pixels, then the digital image can be represented by C two-dimensional matrices. For example, a digital image can be composed of multiple feature maps IN1, IN2, and IN3, and the total data volume of the digital image is W*H*C, where W is the width of the feature map, and H is the height of the feature map , C is the dimension (or number of channels).

[0033] Assuming that a digital image needs to be convolved with a set of convolution kernels (kernel), similarly, each of a set of convolution kernels can be represented by C two-dimensional matrices, then a set of convolution kernels The total amount of data is w*h*C*k, ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a convolution hardware acceleration method and a convolution hardware acceleration circuit. The convolution hardware acceleration method comprises the following steps: segmenting a feature map into a plurality of map blocks; during segmentation, the column width of the segmented image blocks is matched with the number of the MAC operation units, and the row height of the segmented image blocks is the maximum row number which can be accommodated by the internal storage module; reading data required by a block and a corresponding convolution kernel for convolution operation in sequence; extracting the extraction data of the image block and the corresponding convolution kernel in sequence; and performing convolution operation and outputting an operation result to an external storage module. By adopting the convolution hardware acceleration method provided by the invention, a smaller internal cache area can be used for supporting feature maps with any resolution to carry out convolution operation; by adopting the block segmentation mode, the utilization efficiency of the MAC operation unit is relatively high, the reading of boundary data in the vertical direction can be reduced as much as possible, and the convolution operation speed is further improved.

Description

technical field [0001] The invention relates to the technical field of convolutional neural network and chip design, in particular to a convolutional hardware acceleration method and a convolutional hardware acceleration circuit. Background technique [0002] Convolution neural network (CNN) is widely used in target recognition, target detection and other application fields. Currently, as the depth of the neural network and the amount of input feature map data increase, the amount of convolution operations continues to increase. The convolution operation of CNN can be performed through a dedicated chip to speed up the calculation speed. However, the design of a dedicated chip for convolution calculation of CNN will face two problems, one is how to increase the parallelism of the operation to improve the calculation efficiency, and the other is how to reduce the read and write bandwidth of the memory to save calculation power consumption. [0003] Therefore, how to provide ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/063G06N3/04
CPCG06N3/063G06N3/045Y02D10/00
Inventor 沈强陆金刚方伟
Owner 저장진셍일렉트로닉스테크놀러지컴퍼니리미티드
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products