Unlock instant, AI-driven research and patent intelligence for your innovation.

Neural network lightweight deployment method based on three-objective joint optimization

A joint optimization and neural network technology, applied in the field of neural network compression, can solve the problems of adjusting the optimal value of FLOPs and poor generalization, so as to improve the accuracy of the model and solve the problem of not being able to balance and taking into account the accuracy of the model.

Active Publication Date: 2022-06-03
江苏省现代企业信息化应用支撑软件工程技术研发中心
View PDF3 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Therefore, most of the current pruning algorithms are structured pruning of network models, but structured pruning needs to set hyperparameters for each layer, and when pruning, the network needs to iterate multiple times to converge
Non-Patent Document 1 (Shangqian Gao, Feihu Huang, Jian Pei, and Heng Huang. Discrete modelcompression with resource constraint for deep neural networks. In Proceedings of the IEEE / CVF Conference on Computer Vision and Pattern Recognition , pages1899–1908, 2020.) Using discrete particle swarm optimization and differentiable discrete gates to search for the best precision subnetwork under a given FLOPs budget, its disadvantage is that it searches for the solution with the highest precision under a given FLOPs budget, which is a For single-objective optimization, since FLOPs are fixed, the optimal value of FLOPs cannot be adjusted during optimization, which leads to poor generalization

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Neural network lightweight deployment method based on three-objective joint optimization
  • Neural network lightweight deployment method based on three-objective joint optimization
  • Neural network lightweight deployment method based on three-objective joint optimization

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0020]The present invention will be further described below with reference to the accompanying drawings and specific embodiments, so that those skilled in the art can better understand the present invention and implement it, but the embodiments are not intended to limit the present invention.

[0021] Problem description: The input objective function is as follows:

in, m is the number of objective functions, n is the number of model channels.

[0022] In the present invention, m =3, three objective functions They are the model floating point number FLOPs, the model parameter quantity and the model precision. The model precision is determined according to the monitoring task requirements, and the model floating point number FLOPs and model parameter quantity are based on the computing power of the terminal device (terminal device) and the power consumption per unit time that can be tolerated. to make sure. Therefore, the above formula (1) can be written as:

in, ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a neural network lightweight deployment method based on three-objective joint optimization, and the method comprises the steps: mounting a terminal device on an edge side device which is provided with a calculation unit and a connection contact capable of achieving data transmission, inputting an initial network model M and an objective function into the calculation unit, pruning the initial network model M to obtain an optimized optimal network model M '; transmitting the optimal network model M'to a terminal device through a connection contact; acquiring an image sequence of the to-be-recognized target in real time by using a camera device on the terminal equipment; and inputting the image sequence into the optimal network model M ', and outputting an identification result. According to the method, the pruned and optimized lightweight network model is deployed on the terminal equipment in a low-calculation-power or low-energy-consumption-required scene, so that lightweight operation of the terminal equipment is realized, and the limitation that a large-scale neural network model cannot be deployed on the terminal equipment with limited performance and electric quantity is overcome.

Description

technical field [0001] The invention relates to the technical field of neural network compression in edge intelligence scenarios, in particular to a lightweight deployment method of neural networks based on three-objective joint optimization. Background technique [0002] At present, Convolutional Neural Networks (CNN) have achieved great success in tasks such as computer vision and natural language processing, and have been applied in many practical applications. For example, UAV equipment, etc., UAV equipment can be used to perform monitoring tasks such as fire rescue, cyanobacteria outbreak, and watershed pollution. When the UAV equipment performs tasks, it needs to capture the target image through the camera, and input the target image into the convolutional neural network for target recognition. In order to improve the accuracy of target recognition, the layers of this type of convolutional neural network are deeper and the structure is more and more complex. Such a m...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/08G06V20/10
CPCG06N3/082
Inventor 张量方立刚吴尘鲜学丰周亚峰董虎胜
Owner 江苏省现代企业信息化应用支撑软件工程技术研发中心