A deep separable convolutional neural network acceleration method and accelerator

A convolutional neural network and deep convolution technology, applied in the field of depth-separable convolutional neural network acceleration methods and accelerators, to achieve efficient support, reduce access, and reduce power consumption

Active Publication Date: 2021-10-22
PEKING UNIV SHENZHEN GRADUATE SCHOOL
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The main purpose of the present invention is to provide a depth separable convolutional neural network acceleration method and accelerator, so as to solve the technical problem that the depth separable convolution is not optimized in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A deep separable convolutional neural network acceleration method and accelerator
  • A deep separable convolutional neural network acceleration method and accelerator
  • A deep separable convolutional neural network acceleration method and accelerator

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] Specific embodiments of the present invention will be described in detail below in conjunction with the accompanying drawings. It should be understood that the specific embodiments described here are only used to illustrate and explain the present invention, and are not intended to limit the present invention.

[0040] see figure 1, which shows a depth-separable convolutional neural network acceleration method according to an embodiment of the present invention, including:

[0041] S101. Perform depth convolution on the input neurons. When performing the depth convolution calculation, the same M rows of the C input channel are independently and parallelly calculated in the three-dimensional processing unit PE array, and the same N rows of output neurons of the C channel are obtained. N

[0042] S102. Perform point convolution on the output neurons obtained by the depth convolution. When performing the point convolution calculation, each row of data of the C channel...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention provides a deep separable convolutional neural network acceleration method, including: performing deep convolution on the input neuron, and when performing the deep convolution calculation, the same M lines of the C input channel are in the three-dimensional processing unit PE array Independent and parallel computing, to get the output neurons of the same N rows of the C channel, N

Description

technical field [0001] The present invention relates to the technical field of depth separable convolutional neural network, in particular, to an acceleration method and accelerator for depth separable convolutional neural network. Background technique [0002] Convolutional Neural Networks (CNNs) have seen great performance in areas of computer vision such as image classification and object recognition. CNNs are widely used in autonomous vehicles, IoT devices, and robot vision due to their high accuracy. These applications usually require CNNs to work in an environment with constrained hardware resources and low power consumption. This poses a huge challenge because CNN models usually require millions of parameters and computations, therefore, it is important to design lightweight neural networks. In recent years, there has also been increasing interest in developing small and compact CNN models, which will further help reduce computational requirements. Recent CNN model...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06N3/04G06N3/063G06F7/50G06F7/523
CPCG06N3/063G06F7/50G06F7/523G06N3/045
Inventor 李肖飞雍珊珊张兴王新安李秋平刘焕双郭朋非高金潇
Owner PEKING UNIV SHENZHEN GRADUATE SCHOOL
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products