Unlock instant, AI-driven research and patent intelligence for your innovation.

Model distillation method and device, electronic equipment and storage medium

A distillation method and distillation device technology, applied in the fields of artificial intelligence and deep learning, can solve problems such as poor overall effect, low distillation efficiency, and waste of parameters, and achieve the effect of no parameter redundancy and improved distillation efficiency and effect

Pending Publication Date: 2020-10-27
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in the above first method, since distillation is only performed on the last layer of the neural network, the efficiency of distillation is low, and the overall effect is poor; in the above second method, an additional fully connected layer is used for distillation, and some parameters are wasted , the distillation effect is not ideal

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Model distillation method and device, electronic equipment and storage medium
  • Model distillation method and device, electronic equipment and storage medium
  • Model distillation method and device, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] Exemplary embodiments of the present application are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present application to facilitate understanding, and they should be regarded as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the application. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.

[0023] The following describes the distillation method, device, electronic equipment and storage medium of the model of the embodiment of the present application with reference to the accompanying drawings.

[0024] figure 1 is a schematic diagram according to the first embodiment of the present application. Wherein, it should be noted that the execution subject of...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a model distillation method and device, electronic equipment and a storage medium, and relates to the technical field of deep learning. According to the specific implementationscheme, the method comprises steps: firstly, obtaining a teacher model and a student model; then, according to the first data processing amount of the first middle full-connection layer of the teacher model and the second data processing amount of the second middle full-connection layer of the student model, converting the second middle full-connection layer into an amplification full-connectionlayer and a reduction full-connection layer, and replacing the second middle full-connection layer with the amplification full-connection layer and the reduction full-connection layer to generate a training student model; and then carrying out distillation training on the trained student model according to the teacher model. According to the method, the second middle full-connection layer is replaced with the enlargement full-connection layer and the reduction full-connection layer, and distillation training is carried out on the training student model according to the teacher model, so thatdistillation is carried out on the middle layer of the training student model, an extra full-connection layer does not need to be introduced, parameter redundancy is avoided, and the distillation efficiency and effect are greatly improved.

Description

technical field [0001] The present application relates to the technical field of artificial intelligence, specifically to the technical field of deep learning, and in particular to a model distillation method, device, electronic equipment, and computer-readable storage medium. Background technique [0002] Currently, deep neural network models are widely used in the field of artificial intelligence. Among them, most effective models require complex calculations, and it is difficult to achieve real-time calculations in Internet scenarios. [0003] In related technologies, there are two main ways to solve the above problems by obtaining a small model with a small amount of calculation through distillation calculation of a complex large model. The first way is to distill the large model in the last layer of the neural network. The predicted results of the model are used as soft labels to assist the training of the small model; the second way is to distill in the middle layer o...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06N3/04G06N3/08
CPCG06N3/08G06N3/045G06N3/082G06N3/04G06N5/022
Inventor 苏炜跃冯仕堃朱志凡李伟彬何径舟黄世维
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD