Neural network quantification method
A technology of neural network and quantitative method, applied in the direction of neural learning method, biological neural network model, etc., can solve the problems of full-precision network error, full-precision weight value error, neural network precision loss, etc., to improve accuracy and expression Ability, the effect of speeding up training
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Examples
Embodiment 1
[0027] A neural network quantification method specifically comprises the following steps:
[0028] S1, the forward propagation stage of neural network training;
[0029] S2, the backpropagation stage of neural network training;
[0030] S3. S1 and S2 are repeated until the neural network converges, and the quantization of the deep neural network is completed.
[0031] In S1, before neural network training, an array of quantized weight values with low bits as indexes is initialized as a full-precision quantized value storage model.
[0032] In S1, the specific process is: first quantize the weight value of the current layer network with a quantization function, then calculate the output of the current layer network, store the weight value matrix with a low-bit index value to refer to the corresponding full-precision quantization value, and calculate the first layer of the neural network Loop through to the last layer.
[0033] In S2, the following steps are specifically in...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More