Method for quantizing weight by channels
A technology of quantizing weights and splitting channels, which is applied in the field of neural network acceleration, can solve the problems of reducing model accuracy and insufficient utilization of low-bit data, and achieve the effects of improving utilization, increasing convergence speed, and fully utilizing
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0032] In order to understand the technical content and advantages of the present invention more clearly, the present invention will be further described in detail in conjunction with the accompanying drawings.
[0033] Such as figure 1 As shown, a method for sub-channel quantization weights of the present invention specifically includes the following steps:
[0034] S1, convolutional neural network training: train the model with a full-precision algorithm. The full-precision algorithm is an image classification algorithm based on Resnet-50 as a neural network structure to obtain a network for target classification, that is, to obtain relevant parameters in the model reasoning process. , the relevant parameters include the weight of the convolution, the bias of the BiasAdd operator, the gamma, beta, mean and variance of the BatchNormal operator;
[0035] S2, fine-tuning the quantized model:
[0036] S2.1, for the model obtained from S1, quantify the weight according to the r...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com