Model compression method and processing equipment based on pruning, weight sharing and coding
A compression method and weight technology, applied in the field of model compression, can solve the problems of poor calculation ability of artificial intelligence models, insufficient model compression, no weight sharing, etc., to reduce the amount of calculation and storage overhead, improve capabilities, and reduce energy consumption. consumption effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0027] Such as figure 1 As shown, the present invention provides a model compression method based on pruning, weight sharing and encoding, which includes the following steps. S100: Through normal training, a model composed of original network weights and neurons is obtained, and the network training method adopts the technology in the existing neural network to obtain the initial model before compression. S200: According to the threshold setting (the specific threshold can be set according to the specific distribution of the original network weights of the model, or the computing power of the mobile terminal, such as the original network weight 0.05 as the threshold), the connection network between neurons in the model Perform pruning processing, that is, remove unnecessary original network weights and corresponding neurons, so that only important weight parameters for the network are retained. Specifically, the original network weights and corresponding neurons that are lower...
Embodiment 2
[0038] A processing device comprising: one or more processors; a memory for storing one or more computer programs, one or more of the processors for executing the one or more computer programs stored in the memory, so that One or more of said processors execute the model compression method based on pruning, weight sharing and encoding of the present invention. The processing device reduces the computational load and storage overhead of the artificial intelligence model through pruning processing, weight sharing, weight smoothing and encoding operations, has strong applicability, and also reduces energy consumption.
[0039] Those skilled in the art can understand that all or part of the features / steps of the above-mentioned method embodiments can be realized by methods, data processing systems or computer programs, and these features can be implemented without hardware, all by software, or by hardware combined with software. The foregoing computer program may be stored in one...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


