Supercharge Your Innovation With Domain-Expert AI Agents!

Deep neural network model pruning method, system and device and medium

A deep neural network and network model technology, applied in the field of artificial intelligence, can solve problems such as pruning methods for deep network models, reduce the amount of calculation and parameters, and ensure the effect of network accuracy

Pending Publication Date: 2022-07-12
XIAN MICROELECTRONICS TECH INST
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Aiming at the technical problems existing in the prior art, the present invention provides a deep neural network model pruning method, system, equipment and medium to solve the problem that most of the existing model pruning methods are aimed at the traditional convolution, and there is no depth-capable Technical issues with pruning methods for deep network models that separate convolutions

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Deep neural network model pruning method, system and device and medium
  • Deep neural network model pruning method, system and device and medium
  • Deep neural network model pruning method, system and device and medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0091] In this embodiment, the pruning method of the deep neural network model is described in detail by taking the pruning process of MobileNetv2 as an example.

[0092] as attached image 3 As shown, this embodiment provides a deep neural network model pruning method, including the following steps:

[0093] Step 1. Sparse training of the model MobileNetv2

[0094] According to the training framework encapsulated by the PyTorch deep neural network library, the target detection model MobileNetv2 is sparsely trained on an Nvidia RTX 2070 GPU with 8G memory in an end-to-end manner.

[0095] The optimal penalty factor λ of the regularization loss function for sparse training is 1e-5; and the stochastic gradient method (SGD) is used as the optimizer in the back-propagation process, and its weight decay is set to 5e-4 and the momentum is 0.9.

[0096] At the beginning of training, random weights are used to initialize the weights of the baseline model; the input images are unifor...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a deep neural network model pruning method, system and device and a medium, and the method comprises the steps: carrying out the sparse training of a to-be-pruned model, and obtaining a sparse model; wherein the to-be-pruned model is a deep neural network model of a depth separable convolution; the depth separable convolution comprises a depth-wise convolution and a point-wise convolution, and the depth separable convolution comprises a depth-wise convolution and a point-wise convolution; on the basis of an importance evaluation result of a weight absolute value of each channel in point-wise convolution, pruning a convolutional layer channel of the sparse model to obtain a pruned network model; and performing fine tuning training on the weight of the pruned network model, and outputting the fine-tuned network model, thereby obtaining the pruned deep neural network model. According to the method, the sparsification of the point-wise convolution weight is realized, the network precision is ensured, and meanwhile, the calculation amount and the parameter amount of the model can be effectively reduced.

Description

technical field [0001] The invention belongs to the technical field of artificial intelligence, and particularly relates to a deep neural network model pruning method, system, equipment and medium. Background technique [0002] At present, artificial intelligence technology with convolutional neural network as the core technology has made a series of breakthroughs, and it is gradually applied to weapon equipment and various types of spacecraft to achieve accurate detection of satellite-based on-orbit targets and intelligent target recognition of missiles. Strike, autonomous obstacle avoidance, mission planning and other applications; but the deep neural network models that the above applications rely on usually have many model parameters and a large amount of calculation; in most practical application scenarios, the computing unit of the neural network model in AI application embedded devices is in The volume and power consumption are often limited, resulting in too many mod...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/08G06N3/04G06V10/82
CPCG06N3/082G06N3/045
Inventor 马钟樊一哲毛远宏
Owner XIAN MICROELECTRONICS TECH INST
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More