3D CNN acceleration method and system based on Winograd algorithm

An algorithm and graph technology, applied in the field of 3DCNN acceleration method and system based on Winograd algorithm, can solve problems such as 3DCNN acceleration research

Active Publication Date: 2018-05-04
NAT UNIV OF DEFENSE TECH
View PDF5 Cites 24 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] According to this embodiment, the current FPGA-based CNN accelerators are all 2

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • 3D CNN acceleration method and system based on Winograd algorithm
  • 3D CNN acceleration method and system based on Winograd algorithm
  • 3D CNN acceleration method and system based on Winograd algorithm

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0056] Example one:

[0057] Such as figure 1 As shown, the implementation steps of the 3D CNN acceleration method based on Winograd algorithm in this embodiment include:

[0058] 1) Read the feature map sub-block Bin to be transformed from the input feature map in, and read the convolution kernel sub-block Bw from the weight buffer w;

[0059] 2) Perform the 3D Winograd algorithm on the feature map sub-block Bin and the convolution kernel sub-block Bw to output the result Tp 1 ;

[0060] 3) Tp for the output result of the 3D Winograd algorithm 1 Carry out accumulation and output the accumulation result Sum;

[0061] 4) Judge whether all the input feature maps in the input feature map in have been read, if it has not been read, skip to step 1); otherwise, skip to step 5);

[0062] 5) Write the accumulation result Sum back to the output feature map buffer Out.

[0063] In this embodiment, when the feature map sub-block Bin to be transformed is read from the input feature map in in step 1)...

Example Embodiment

[0161] Embodiment two:

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a 3D CNN acceleration method and system based on a Winograd algorithm. The method includes the steps of reading a feature map subblock to be transformed from an input feature map, and reading a convolution kernel subblock from a weight cache, performing the 3D Winograd algorithm on a feature map subblock Bin and the convolution kernel subblock to output results and accumulating output accumulation results, and determining whether all the input feature maps are read; if all the input feature maps are read, writing the accumulation results back to an output feature map cache Out. By extending the Winograd algorithm and using the Winograd algorithm for the 3D CNN calculation, a 2D algorithm is applied to perform CNN acceleration and achieve a good effect, the computational complexity of the CNN algorithm can be effectively reduced, and the computing performance and energy efficiency ratio of a 3D CNN accelerator based on an FPGA are improved.

Description

technical field [0001] The invention relates to 3D CNN (three-dimensional convolutional neural network) acceleration technology, in particular to a Winograd algorithm-based 3D CNN acceleration method and system for an embedded platform. Background technique [0002] With the development of the field of artificial intelligence, three-dimensional convolutional neural network (Three-dimensional Convolutional Neural Network, 3D CNN) has been widely used in many complex computer vision applications, such as video classification, human motion detection and medical image analysis. Different from the traditional two-dimensional convolutional neural network (Two-dimensional Convolutional Neural Network, 2D CNN), 3D CNN can retain the time information in the 3D image during the processing process, so it can achieve comparative results in the field of 3D image recognition and classification. 2D CNN works better. [0003] With the improvement of CNN recognition accuracy, the CNN networ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06T1/20G06N3/04
CPCG06T1/20G06N3/045
Inventor 沈俊忠黄友王泽龙乔寓然陈照云曹壮文梅张春元
Owner NAT UNIV OF DEFENSE TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products