Adaptive quantization method adaptive to neural network accelerator running on FPGA
An adaptive quantization and neural network technology, applied in the field of adaptive quantization and neural network accelerators, can solve problems such as the inability to ensure the correctness of calculation results and the overflow problem, and achieve easy deployment and implementation, save storage space and computing resources, The effect of ensuring correctness
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0030] Taking a neural network model with only one Convolution layer as an example, the present invention will be further described in detail.
[0031] (1) If the invention is not adopted, that is, the overflow problem that may occur during the integer data operation process is not considered when deployed on the FPGA, then the calculation process and results are deduced as follows:
[0032] The parameters of the Convolution layer in this neural network model are pad=0, stride=1, the input size is 1×3×3, and its value is The filter size is 2×2 and its value is The bias is 1.213093. The process of deploying the neural network model on the neural network accelerator is as follows:
[0033] (a) The quantization coefficient is pre-calculated by the quantization software
[0034] Take the quantization bit width as 8 as an example. The quantization parameter th is obtained by the quantization software in = 11.32375,th w =8.134685,th iout = 153.37865. The quantization coeff...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


