Deep neural network model compression method based on pruning threshold automatic search

A deep neural network and network model technology, which is applied in the field of deep neural network model compression based on automatic search of pruning thresholds, can solve the problems of difficult recovery of model accuracy and difficulty in achieving a good balance between model accuracy and model compression rate. , to achieve the effect of good model compression effect, good adaptability, and the process of reducing the pruning rate

Pending Publication Date: 2020-01-10
UNIV OF ELECTRONIC SCI & TECH OF CHINA
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

There are two problems here. One problem is that the definition of the pruning rate is artificially specified, rather than the model is automatically found. There may be a better value for the model pruning t

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Deep neural network model compression method based on pruning threshold automatic search
  • Deep neural network model compression method based on pruning threshold automatic search
  • Deep neural network model compression method based on pruning threshold automatic search

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031] In order to make the purpose, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the implementation methods and accompanying drawings.

[0032] see figure 1 , the deep neural network model compression method based on pruning threshold automatic search of the present invention comprises the following steps:

[0033] S1: Perform model training on the original network model to be compressed to obtain the initial model for pruning;

[0034] S2: Interval threshold search, performing adaptive grid search on model parameters to obtain the first pruning threshold;

[0035] S3: Pruning threshold search optimization, combined with the binary search method to further search the threshold interval corresponding to the first pruning threshold, find a better threshold value, and obtain the second pruning threshold;

[0036] Among them, the threshold interval corresponding to the fir...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a deep neural network model compression method based on pruning threshold automatic search, and belongs to the field of deep neural network model compression. The method comprises the following steps: training a model to obtain an initial model for pruning; performing adaptive grid search on the model parameters to obtain a first pruning threshold; further searching for a threshold interval corresponding to the first pruning threshold by combining a binary search method, and searching for a better threshold to obtain a second pruning threshold; performing iterative pruning processing on the original network model based on the second pruning threshold; and carrying out sparse storage on the pruned model to obtain an available compressed network model. According to the deep neural network model compression method based on pruning threshold automatic search, an existing main deep neural network model can be compressed, the technical problem that the deep neural network model cannot be deployed to embedded equipment due to the fact that the model is large is solved, and the application range of the deep neural network model is expanded.

Description

technical field [0001] The invention belongs to the field of deep neural network model compression, in particular to a deep neural network model compression method based on pruning threshold automatic search. Background technique [0002] The development of deep learning has made deep neural networks more and more applied to computer vision tasks such as image recognition, detection and tracking, and more and more network models tend to be designed in a wider and deeper direction. The success of deep learning largely depends on a large number of parameters of the model and computing equipment with powerful performance. However, due to the huge memory requirements and computational consumption of deep neural networks, it is difficult to deploy them on low-storage, low-power hardware platforms (such as mobile devices), which greatly limits their applications. Therefore, it is an important issue to study how to effectively compress the neural network model while keeping the pe...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06N3/08G06N3/04
CPCG06N3/082G06N3/045
Inventor 刘欣刚钟鲁豪朱超王文涵吴立帅代成
Owner UNIV OF ELECTRONIC SCI & TECH OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products