Neural network quantization compression method and system
A neural network, quantization compression technology, applied in the field of neural network computing, can solve the problems of increasing the average code length of Huffman coding, and achieve the effect of increasing input randomness, reducing complexity, and avoiding deadlock
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0067] In order to make the above-mentioned features and effects of the present invention more clearly and comprehensible, the following specific embodiments are given and described in detail in conjunction with the accompanying drawings in the description as follows.
[0068] In order to optimize the neural network compression method, this paper analyzes the distribution characteristics of the data in the neural network model after pruning and quantization, and proposes a lossless compression algorithm that combines entropy coding, run-length coding and all-zero coding. The hardware deployment form has been fully explored, and finally the NNcodec neural network encoding and decoding simulator is designed and implemented. The optimization effect of the hybrid coding on the neural network compression method is proved, and an easy-to-implement hardware design scheme is also given.
[0069] In order to make the above-mentioned features and effects of the present invention more cl...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


