Dynamic mixing precision model construction method and system
A construction method and dynamic mixing technology, applied in the field of deep neural network, can solve problems such as limited convertible states, application of difficult bit scenarios, restrictions on the promotion and use of adaptive quantization models, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0042] Such as Figure 1-4 As shown, the dynamic mixed precision model construction method described in the present invention is specifically as follows:
[0043] After preprocessing the original data such as zero padding, random cropping, and random flipping, 8-bit quantization is performed to obtain the input data.
[0044] Use cross entropy as loss function and SGD as optimizer.
[0045] To train a full-precision model, first set the parameter matrix w i Zoom to [-1,+1] interval, the formula is Then perform forward propagation, and update the parameter w during backpropagation i . In this embodiment, weights are used as the parameters, and the parameter matrix is the weight matrix.
[0046]Select part of the training data, and calculate the approximate value of the trace of the Hessian matrix of each block parameter of the full-precision model, specifically, including the following sub-steps:
[0047] (1) Randomly sample 2000 pieces of training data, and input th...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com