Unlock instant, AI-driven research and patent intelligence for your innovation.

Stepwise convolution neural network pruning compression method

A convolutional neural network and compression method technology, applied in the field of stepwise convolutional neural network pruning and compression, can solve the problem of limited computing resources and storage resources, unsatisfactory compression efficiency, and does not take into account the complex changes of neurons, etc. problem, to achieve the effect of improving training speed and reducing accuracy.

Inactive Publication Date: 2018-03-09
深圳互连科技有限公司
View PDF0 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, for embedded terminal devices, due to their limited computing resources and storage resources, most CNN networks such as VGG19 / VGG16, ResNet, GoogLeNet, etc. can only Deployed on high-end cloud platforms supported by GPU or FPGA
In 2016, ICLR's bestpaper proposed a compression pruning scheme (Deep Compression), but the algorithm's estimation of invalid neurons simply set a threshold for the absolute value of neurons, and did not take into account the training process. Complex changes in neurons, so the compression efficiency is still not satisfactory

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Stepwise convolution neural network pruning compression method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0014] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the drawings in the embodiments of the present invention.

[0015] The invention provides a step-by-step convolutional neural network pruning compression method, the working principle of which is to perform pruning operations on neurons that are not important at the current moment in the convolutional neural network, judge after multiple iterations, and recover The neurons that are important at the current moment, pruning the neurons that are not important at the current moment, and repeating the operation, achieve the purpose of effective compression and fast calculation of the convolutional neural network.

[0016] The present invention will be described in further detail below in conjunction with examples and specific implementation methods.

[0017] Such as figure 1 As shown, a stepwise convolutional neural network pruning compre...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a stepwise convolution neural network pruning compression method, which utilizes GPU to calculate the L1 norm and variance of each layer neuron in a convolution neural networkmodel; the importance of each neuron at the moment is estimated according to the L1 norm and variance of each layer neuron; as for the unimportant neuron at the moment, it is compulsorily set to 0, and then the backpropagation error item in the same location is also set to 0; the convolution neural network is recalculated and evaluated after being trained by m iterations training; all the active neurons after being evaluated enter into the next round of the m iterations training; the above process is repeated n times, so that the learning rate will be attenuated step by step. The unimportant neurons in the convolution neural network are pruned at the moment, and then are judged after iterating again and again. The important neurons in the current moment are restored, and the unimportant neurons at present are pruned. The operation is repeated to achieve the purpose of effective compression and fast calculation of the convolution neural network.

Description

technical field [0001] The invention relates to the technical field of convolutional neural network pruning compression, in particular to a step-by-step convolutional neural network pruning compression method. Background technique [0002] Convolutional Neural Networks (CNN) is an efficient image recognition and classification method deep learning architecture that has been developed rapidly since 2012 and has attracted widespread attention. This architecture effectively improves the ability of image classification and object recognition. Compared with the traditional method of manually extracting features, CNN can directly input the original image, avoiding complicated preprocessing, and has higher recognition accuracy; compared with the traditional Back Propagation (BP) neural network, CNN Due to the use of the sharing strategy, the number of neurons is greatly reduced, thus avoiding the phenomenon of over-fitting to a certain extent. [0003] However, for embedded termin...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/08G06N3/04
CPCG06N3/082G06N3/045
Inventor 牟星
Owner 深圳互连科技有限公司