Unlock instant, AI-driven research and patent intelligence for your innovation.

Sparse convolutional neural network accelerator and calculation method

A convolutional neural network and accelerator technology, applied in the field of sparse convolutional neural network accelerators and computing, can solve the problems of different activation input sparsity, reduced parallel efficiency, increased resource overhead, etc., to ensure parallel computing efficiency, performance and Resource Efficiency, Improved Adaptability and Utilization Effects

Active Publication Date: 2020-09-04
INST OF SEMICONDUCTORS - CHINESE ACAD OF SCI +1
View PDF2 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the use of activation sparsity requires additional logic to judge or process activation inputs, which will increase additional resource overhead; in addition, different activation input sparsities are different, and queue lengths are different. In parallel computing, waiting for the longest queue will cause Reduce parallel efficiency; and, there is an upper limit to the way to share activation input and expand output feature maps using tiling. Excessive expansion and output feature map directions will lead to reduced parallel efficiency

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Sparse convolutional neural network accelerator and calculation method
  • Sparse convolutional neural network accelerator and calculation method
  • Sparse convolutional neural network accelerator and calculation method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] In order to make the purpose, technical solutions and advantages of the present disclosure clearer, the present disclosure will be further described in detail below in conjunction with specific embodiments and with reference to the accompanying drawings.

[0028] The first embodiment of the present disclosure shows the structure of a sparse convolutional neural network accelerator. figure 1 A structural block diagram of a sparse convolutional neural network accelerator provided by an embodiment of the present disclosure is schematically shown. figure 2 A structural block diagram of a convolution calculation module provided by an embodiment of the present disclosure is schematically shown. image 3 A structural block diagram of a computing unit provided by an embodiment of the present disclosure is schematically shown. to combine figure 2 and image 3 ,right figure 1 The structure shown is described in detail.

[0029] Such as figure 1 As shown, the sparse convol...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a sparse convolutional neural network accelerator and a calculation method. The accelerator comprises an accelerator body, the convolution calculation module is used for carrying out multiply-add processing on the input feature map and generating an intermediate result, the convolution calculation module is composed of nine calculation units, each calculation unit is composed of one or more multiply-add devices, the multiply-add devices in the same calculation unit have the same activation input, and the nine calculation units are provided with an additional activationinput; a non-linear and pooling module which is used for carrying out non-linear calculation and pooling calculation on the intermediate result to generate an output feature map; and a full connectionlayer calculation module which is used for carrying out full connection calculation on the output feature map to generate a final result. Different working modes are obtained through combination fora plurality of convolution calculation modules, so that the activation sparsity can be effectively utilized to accelerate the convolution neural network calculation, and meanwhile, relatively low additional resource overhead and relatively low load unbalance are generated.

Description

technical field [0001] The present disclosure relates to the field of deep learning, in particular, to a sparse convolutional neural network accelerator and a calculation method. Background technique [0002] In recent years, due to the acquisition of massive data in the era of big data and the significant improvement of computer performance, deep learning algorithms represented by convolutional neural networks have shown great advantages in many fields. However, like the typical classification network VGG-16, it requires 15.5G multiplication and addition operations and 138M parameters. The huge amount of calculations and parameters make the practical application of convolutional neural networks difficult. Experiments show that convolutional neural networks have inherent sparsity, and effective use of sparsity can greatly improve computing performance by reducing the amount of calculations. However, the existing design of convolutional neural network accelerators ignores the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/06
CPCG06N3/06Y02D10/00
Inventor 余成宇李志远毛文宇鲁华祥边昳
Owner INST OF SEMICONDUCTORS - CHINESE ACAD OF SCI
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More