Model compression method and device, electronic equipment and computer storage medium

A model and network model technology, applied in the field of model compression, can solve the problems of teacher network detection performance gap, single-stage application has not been explored, student network performance is not ideal, etc., to achieve the effect of target detection performance improvement

Inactive Publication Date: 2019-05-03
BEIJING SENSETIME TECH DEV CO LTD
View PDF3 Cites 38 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the performance of the distilled student network is not ideal, and there is still a certain gap between the performance of the teacher network and the detection performance of the teacher network.
Moreover, the current distillation learning is based on a two-stage (Two-stage) target detection network, and the application of single-stage (One-stage) target detection has not yet been explored.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Model compression method and device, electronic equipment and computer storage medium
  • Model compression method and device, electronic equipment and computer storage medium
  • Model compression method and device, electronic equipment and computer storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] The present application will be described in further detail below through specific embodiments in conjunction with the accompanying drawings. In the following implementation manners, many details are described for better understanding of the present application. However, those skilled in the art can readily recognize that some of the features can be omitted or replaced by other methods in different situations. In some cases, some operations related to the present application are not shown or described in the specification, in order to avoid the core part of the present application from being overwhelmed by too many descriptions. For those skilled in the art, it is not necessary to describe these related operations in detail, and they can fully understand the related operations based on the description in the specification and general technical knowledge in the field.

[0047] It should be understood that when the terms are used in this specification and the appended cl...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a model compression method and device, electronic equipment and a computer storage medium. The method comprises the steps that training sample data are acquired, and the training sample data comprise label sample data; Respectively training the teacher network model and the student network model by using the training sample data to obtain an adaptive distillation loss function and a focus loss function; and performing back propagation on the student network model according to the adaptive distillation loss function and the focus loss function to obtain a trained studentnetwork.

Description

technical field [0001] The present application relates to the field of artificial intelligence, in particular to a model compression method, device, electronic equipment and computer storage medium. Background technique [0002] In recent years, deep learning networks have achieved great success in object detection applications in the field of computer vision. However, since the deep learning network model often contains a large number of model parameters, the amount of calculation is large, the processing speed is slow, and its application is mostly in the cloud, so it still faces huge challenges in the implementation of the terminal. [0003] In order to reduce the redundancy of the network model, researchers at home and abroad have proposed a distillation learning algorithm. In distillation learning, the training of the student network model is guided by refining or distilling the knowledge of the teacher network with a complex structure to the student network model with ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/08
Inventor 唐诗涛冯俐铜旷章辉张伟陈益民
Owner BEIJING SENSETIME TECH DEV CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products