A dynamically reconfigurable convolution neural network accelerator architecture oriented to the field of the Internet of things
A convolutional neural network and Internet of Things technology, applied in the field of dynamic reconfigurable convolutional neural network accelerator architecture, can solve the problems of inability to apply intelligent mobile terminals, energy efficiency (low performance/power consumption, high power consumption, etc.), and achieve network Simple structure, reduced external memory access, and low power consumption
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0090] For the speed index, the superiority of the present invention comes from the design of the processing unit array and the cache architecture. First, the processing unit adopts the Winograd convolution acceleration algorithm. For example, for 5*5 input data, 3*3 convolution kernel size, and convolution operation with stride 1, traditional convolution requires 81 multiplication operations. It is published that each processing unit only needs to introduce 25 multiplications. In addition, the processing unit array in the convolutional network, the input channel and the output channel are processed with a certain degree of parallelism, which makes the convolution operation faster. On the other hand, the cache architecture has two working modes. In the on-chip working mode, the data generated by the middle layer of the convolutional neural network does not need to be stored off-chip and can be directly sent to the next layer of the network. For lightweight convolutional neural...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


