An Adaptive Asymmetric Quantized Compression Method for Deep Neural Network Models
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Patents(China)
- Current Assignee / Owner
- BEIJING UNIV OF TECH
- Publication Date
- 2020-11-24
Smart Images

Figure 1 
Figure 2 
Figure 3
Abstract
Description
technical field
[0001] The invention relates to the technical field of deep neural network model compression, in particular to an adaptive asymmetric quantized deep neural network model compression method. Background technique
[0002] In recent years, deep learning has gradually replaced the application of traditional machine learning in daily life. In a series of machine learning tasks such as speech recognition, image classification, and machine translation, deep neural networks have achieved certain results. However, the classic deep neural network model, due to its heavy hierarchical structure, brings millions of floating-point network parameter calculations, making it difficult for most networks to be deployed in mobile devices and embedded devices and maintain good processing performance . How to maximize the compression of neural network parameters and ensure that the recognition performance of the original network is gradually becoming an important research directi...