Convolutional neural network accelerator based on feature map sparsity

A convolutional neural network and feature map technology, applied in the field of convolutional neural network accelerators based on the sparsity of feature maps, can solve problems such as hindering the application deployment of CNN algorithms, not using 0 elements, and consuming large lookup table resources.

Active Publication Date: 2021-07-06
SOUTH CHINA UNIV OF TECH
View PDF5 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, a large amount of data movement and computational complexity in the algorithm pose a huge challenge to the power consumption and performance of the terminal device, which hinders the application deployment of the CNN algorithm in the fields of smart phones, smart cars, and smart homes.
[0003] At present, there have been many methods for hardware acceleration of CNN algorithms, and the design is very good in terms of flexibility and multiplier utilization efficiency, but these designs cannot break through the computational power requirements of the convolutional layer, or guide the model in Sparse the weights during training, then use the sparsity of the weights for calculations, and then sparsely guide the model requires additional training time, which is not conducive to the direct deployment of the model
[0004] In the paper "An Efficient Hardware Accelerator for Structured SparseConvolutional Neural Networks on FPGAs", the use of the 0 element of the weight is realized through a large-scale lookup table, but the method used in the paper needs to consume a lot of lookup table resources on the one hand, and on the other hand requires In the training phase of the neural network model, additionally guide the generation of 0 elements of the model weight, which is not conducive to the direct deployment of the model
At the same time, in the current mainstream neural network models, a large number of Relu activation functions (f(x)=MAX(0,x)) are used, which makes a large number of 0 elements appear in the feature map. The current method does not use these 0 elements.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Convolutional neural network accelerator based on feature map sparsity
  • Convolutional neural network accelerator based on feature map sparsity
  • Convolutional neural network accelerator based on feature map sparsity

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0045] A convolutional neural network accelerator based on feature map sparsity, such as figure 1 As shown, including input feature map encoding module, weight encoding module, data flow control module, sparse matrix calculation module and bus;

[0046] The bus is respectively connected to the data flow control module, the input feature map encoding module, the weight encoding module and the sparse matrix calculation module; the input feature map encoding module encodes the feature map according to the 0 elements in the feature map that do not contribute to the calculation; the weight encoding module according to the input The encoding information of the feature map encoding module provides the corresponding weight data for the sparse matrix calculation module; the data flow control module controls the working mode of other modules according to the register information; the sparse matrix calculation module uses the data provided by the input feature map encoding module and the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a convolutional neural network accelerator based on feature map sparsity. The convolutional neural network accelerator comprises an input feature map coding module, a weight coding module, a data flow control module, a sparse matrix calculation module and a bus. The bus is respectively connected with the data flow control module, the input feature map coding module, the weight coding module and the sparse matrix calculation module; the input feature map encoding module encodes the feature map according to the 0 element in the feature map; the weight coding module provides corresponding weight data for the sparse matrix calculation module according to coding information input into the feature map coding module; the data flow control module controls the working modes of other modules according to the register information; and the sparse matrix calculation module performs convolution calculation by using data provided by the input feature map coding module and the weight coding module. According to the method, the accelerator can be switched to use the sparsity in the weight, the method can be flexibly applied, and the sparse weight is supported.

Description

technical field [0001] The invention relates to the field of convolutional neural network hardware accelerators, and belongs to the technical field of integrated circuit hardware acceleration, in particular to a convolutional neural network accelerator based on feature map sparsity. Background technique [0002] In recent years, artificial intelligence technology has developed rapidly, and deep neural networks have made major breakthroughs in natural language processing and computer vision. With the popularization of mobile devices and the emergence of the Internet of Things, the demand for deploying neural networks on mobile devices or Internet of Things devices is increasing day by day. However, the large amount of data movement and computational complexity in the algorithm pose a huge challenge to the power consumption and performance of terminal equipment, hindering the deployment of CNN algorithms in the fields of smart phones, smart cars, and smart homes. [0003] At ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/04G06N3/063
CPCG06N3/063G06N3/045Y02D10/00
Inventor 秦华标李嘉鑫
Owner SOUTH CHINA UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products