Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A Reconfigurable CNN High Concurrency Convolution Accelerator

An accelerator and convolution technology, applied in neural architecture, biological neural network models, etc., can solve problems such as complex configuration, low efficiency, and high power consumption, and achieve the effects of reducing occupied resources, simplifying control parts, and improving parallelism

Active Publication Date: 2021-10-26
NANJING UNIV
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] At present, the acceleration of neural networks is highly dependent on Nvidia's GPU accelerator card, and its shortcomings of high power consumption and low efficiency limit its application scenarios
For some dedicated neural network accelerators, the utilization rate of convolution computing resources is not high, and the configuration is complicated

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Reconfigurable CNN High Concurrency Convolution Accelerator
  • A Reconfigurable CNN High Concurrency Convolution Accelerator
  • A Reconfigurable CNN High Concurrency Convolution Accelerator

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031] The present invention will be described in detail below in conjunction with the accompanying drawings and specific implementation cases.

[0032] Such as figure 1 , the reconfigurable convolution accelerator is mainly composed of a main controller, a feature map address generation unit, a weight address generation unit, a reconfigurable computing unit, a result address generation unit and a storage exchange unit. The main controller is responsible for receiving operation configuration information, including feature map size, number of feature map channels, convolution kernel size, convolution kernel channel number, output result size, output result channel number, convolution stride and convolution mode, receiving start signal to start each sub-module, internally calculate the index value of each convolution operation cycle according to the configuration information, according to the index value and convolution mode, the feature map address generation unit and the weigh...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention provides a reconfigurable CNN high-concurrency convolution accelerator, including: a weight address generation unit, which generates the address of the convolution kernel data in the cache; a result address generation unit, which generates the address of the result data in the cache; The computing unit can reconstruct the computing array into two kinds of multiply-accumulate tree circuits with different granularities; the feature map address generation unit can generate the address of the feature map data in the cache; the main controller can generate and reset the accumulator synchronously with the address Signal, gates the corresponding circuit in the reconfigurable computing unit, and generates an interrupt signal for the end of the entire operation; the storage exchange unit converts the effective feature map read address and weight read address into a read operation for the storage unit, and converts the effective result Writing addresses and data translates to write operations to memory cells. Beneficial effects: the control part is simplified, the parallelism of multi-channel two-dimensional convolution operation and the efficiency of storage access are greatly improved, and the occupied resources are reduced.

Description

technical field [0001] The invention relates to a hardware architecture for accelerating convolution operations, in particular to a reconfigurable CNN high-concurrency convolution accelerator. Background technique [0002] On the one hand, with the continuous improvement of semiconductor process technology, the computing performance of the processor has been further improved, on the other hand, the explosive development of the mobile Internet, the massive data generated can be easily obtained. Under this background, the neural network has obtained a new development, especially in the fields of image recognition, speech recognition and other key breakthroughs. The convolutional layer in the convolutional neural network belongs to the multi-channel two-dimensional convolution operation, and the size of the input feature map is set to S f ×S f ×C f , the convolution kernel size is S k ×S k ×C f ×C k , the output size is S o ×S o ×C o , the convolution stride is S. th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06N3/04
CPCG06N3/045
Inventor 李丽鲍贤亮李宏炜丰帆李伟
Owner NANJING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products