Winograd convolution splitting method for convolutional neural network accelerator

A convolutional neural network and convolutional neural network technology, applied in the field of Winograd convolution splitting, can solve the problems of reduced utilization of accelerator computing units, increased accelerator resource consumption and power consumption, and reduced accelerator performance. And the effect of flexibility improvement, reduction of introduction, good flexibility

Active Publication Date: 2019-12-03
XI AN JIAOTONG UNIV
View PDF2 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In order to expand the flexibility of the accelerator, Winograd computing units with various parameters need to be designed, which increases the resource consumption and power consumption of the accelerator
Secondly, the accelerator obtains data streams of different shapes from the Winograd operation unit with different parameters, which reduces the utilization rate of the accelerator operation unit and seriously reduces the performance of the accelerator.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Winograd convolution splitting method for convolutional neural network accelerator
  • Winograd convolution splitting method for convolutional neural network accelerator
  • Winograd convolution splitting method for convolutional neural network accelerator

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0098] The present invention can be implemented in a convolutional neural network accelerator PE array.

[0099] For most current convolutional neural network accelerators using the Winograd algorithm, if the present invention needs to be used, two parts of optimization need to be performed. The first is the optimization of the conversion module. Traditional accelerators will design a variety of Winograd conversion modules with different parameters to support multiple convolution shapes. But adopting the name of the present invention only needs a conversion module to support any kind of convolution shape. Such as Figure 5 and 6 As shown, the conversion module supports the conversion of W=4, R=2 and W=4, R=3 through resource multiplexing. On this basis, the number of multiplications introduced by the convolution splitting algorithm is compared with the traditional convolution ( Step size S=1) as shown in Table 3. The second is the optimization of the PE array. In this exam...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a Winograd convolution splitting method for a convolutional neural network accelerator. The method comprises the following steps: 1) reading an input and a convolution kernel of any size from a cache of the convolutional neural network accelerator; 2) judging whether convolution splitting is carried out or not according to the convolution kernel size and the input size, andif convolution splitting needs to be carried out, carrying out the next step; 3) splitting the convolution kernel according to the size and the step length of the convolution kernel, and splitting the input according to the size and the step length of the input; 4) combining and zero-filling the split elements according to the size of the convolution kernel, and combining and zero-filling the split elements according to the input size; 5) performing Winograd convolution on each pair of split input and convolution kernels; 6) accumulating the Winograd convolution results of each input and convolution kernel, and 7) storing the accumulation results in a cache of the convolutional neural network accelerator, so that the convolutional neural network accelerator can support convolution of various different shapes by adopting one Winograd acceleration unit.

Description

technical field [0001] The invention belongs to the field of convolutional neural network algorithms, in particular to a Winograd convolution splitting method for convolutional neural network accelerators. Background technique [0002] Convolutional neural network (CNN) is being widely used in computer vision tasks such as object detection and image classification. However, with the continuous development of network models and the continuous improvement of recognition accuracy, it brings a huge amount of computation and data. Therefore, hardware devices with high performance and low power consumption are required, and at the same time, the flexibility of hardware devices must be guaranteed to meet various network models. [0003] Convolutional neural network accelerators are widely used to accelerate convolutional neural network algorithms on mobile and server. In order to improve its performance, the Winograd algorithm is used to reduce the hardware multiplier introduced b...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/04
CPCG06N3/045Y02D10/00
Inventor 杨晨王逸洲王小力耿莉
Owner XI AN JIAOTONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products