Method and system for compressing sparse weight matrix of convolutional neural network full connection layer
A convolutional neural network and weight matrix technology, applied in neural learning methods, biological neural network models, neural architectures, etc., can solve the problem that the compression rate of sparse weight matrix is not very ideal, and achieve the effect of high compression rate
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0053] see figure 1 , figure 1 Shown is a schematic diagram of the entire fully connected layer sparse weight matrix compression optimization principle. A method for compressing and optimizing a sparse weight matrix of a fully connected layer according to an embodiment of the present invention includes two steps: lossless compression and lossy compression.
[0054] (1) The lossless compression steps are as follows: After unstructured pruning and retraining, the weight matrix of the fully connected layer is a large sparse matrix A. Decompose a large sparse matrix A into a position matrix B and a non-zero value array C; where the size of the position matrix B is the same as the sparse weight matrix A; numerically, the corresponding position of the non-zero value in matrix B in matrix A The value is 1, and the value of other positions is 0.
[0055] In this embodiment, the convolutional neural network model selected for the specific experiment is CaffeNet, and the experimental...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com