Unlock instant, AI-driven research and patent intelligence for your innovation.

Winograd convolution operation acceleration method and acceleration module

A convolution operation and acceleration module technology, applied in the field of convolutional neural network computing, can solve problems such as incompatibility with convolution operations, and achieve the effect of reducing computational complexity

Pending Publication Date: 2021-08-20
XI AN JIAOTONG UNIV
View PDF4 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Aiming at the unbalanced design status of FPGA on-chip resource utilization, the inventor's research group once proposed a multiplication-free convolution acceleration scheme (MF-Conv), which can eliminate the multiplication operation of the convolution operation in the filter_loop. However, the hardware structure of this scheme It is determined by the size of the convolution kernel. For example, the 3×3 MF-Conv hardware acceleration unit cannot be compatible with the 5×5 convolution operation.
As can be seen from Table 2, the current design of CNN presents a trend of diversification in the size of the convolution kernel, so MF-Conv has certain limitations.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Winograd convolution operation acceleration method and acceleration module
  • Winograd convolution operation acceleration method and acceleration module
  • Winograd convolution operation acceleration method and acceleration module

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0056] The present invention will be further described in detail below in conjunction with specific embodiments, which are explanations of the present invention rather than limitations.

[0057] The Winograd F (2 × 2, 3 × 3) convolution acceleration method based on bit precision weight splitting proposed by the present invention is introduced as follows:

[0058] Express the convolution process of two-dimensional Winograd as a matrix form:

[0059] Y=A T [(GgG T )⊙(B T dB)]A (1)

[0060] In the formula, g represents the convolution kernel matrix, and d represents the input matrix.

[0061] Through the stride-based convolution kernel splitting method (SCDM), all convolution windows are decomposed or filled into a 3×3 format. For convolution operations with non-3×3 shapes, use the step-based convolution kernel splitting method to split or fill the input matrix into a 4×4 input matrix, and split or fill the convolution kernel matrix into a 3×3 The convolution kernel matrix;...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a Winograd convolution operation acceleration method and an acceleration module, and the method comprises the steps: splitting or filling an input matrix into a 4 * 4 input matrix and splitting or filling a convolution kernel matrix into a 3 * 3 convolution kernel matrix for the convolution operation of a non-3 * 3 shape through employing a convolution kernel splitting method based on a step length; respectively carrying out Winograd transformation on the 3 * 3 convolution kernel matrix and the 4 * 4 input matrix by using the convolution kernel transformation matrix G and the input matrix transformation matrix BT to obtain a transformed convolution kernel matrix U and a transformed input matrix V; performing weight splitting on elements u[xi]v in the transformed convolution kernel matrix U according to bit-level precision, and performing accumulation operation and shift operation to obtain a matrix Z; and performing Winograd transformation on the matrix Z to obtain an output matrix of convolution operation. According to the method, on-chip resources can be reasonably utilized, the calculation complexity is reduced, and convolution operations of most sizes can be compatible.

Description

technical field [0001] The invention relates to convolutional neural network calculations, in particular to a Winograd convolutional operation acceleration method and an acceleration module. Background technique [0002] Convolutional neural network (CNN) has been widely used in image classification and speech recognition. As the scale of applied data increases, the computational complexity of its network model is also increasing. In recent years, including automatic driving Many applications including CNN put forward higher real-time requirements. According to statistics, convolution calculation accounts for 99% of the total calculation volume of CNN. Therefore, accelerating the operation of convolution layer is the key to improving the calculation speed of convolutional neural network. [0003] High flexibility, low cost, and short design cycle make FPGA-based CNN accelerator designs more suitable for deployment in mobile devices. Depending on the type of convolution alg...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06N3/04G06N3/08G06F15/78
CPCG06N3/08G06F15/7807G06N3/045
Inventor 杨晨吕娴娴范世全耿莉
Owner XI AN JIAOTONG UNIV