Unlock instant, AI-driven research and patent intelligence for your innovation.

Model distillation method and device based on network depth compression and medium

A network depth and distillation method technology, which is applied in the field of model distillation based on network depth compression, can solve problems such as difficulty in running mobile terminals or embedded devices, high computational cost of deep model time, and does not consider the improvement of model accuracy, etc., to achieve the improvement effect , The realization method is simple and the effect is improved

Inactive Publication Date: 2022-03-08
ZHEJIANG LAB
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In order to solve the problems existing in the prior art that the deep model has a large time calculation cost, high network complexity, and is difficult to run on mobile terminals or embedded devices, and most of the commonly used knowledge distillation methods do not consider the improvement of model accuracy by network depth, the main Knowledge distillation is carried out on the features or logits of the similar depth of the teacher model and the student model. For this, the present invention proposes a model distillation method, device and medium based on network depth compression. The specific technical scheme is as follows:

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Model distillation method and device based on network depth compression and medium
  • Model distillation method and device based on network depth compression and medium
  • Model distillation method and device based on network depth compression and medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] In order to make the object, technical solution and technical effect of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments.

[0033] A model distillation method based on deep compression of the present invention uses cosine distance to calculate the characteristic relationship of different data after passing through the model, constructs a loss function based on the Spearman correlation formula, and combines the characteristic relationship of different layers of the simple model with the last layer of the complex model Match the feature relationship, and build a loss function based on this, guide the data feature relationship of different layers of the simple model to the deepest data feature relationship of the complex model, so that the shallow layer of the simple model can learn deeper feature information, thereby realizing network depth. compression.

[0034] Sp...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the field of model compression, in particular to a model distillation method and device based on network depth compression and a medium. According to the method, a cosine distance is used for calculating the characteristic relation of different data passing through a model, and a loss function is constructed based on a Spearman correlation formula; matching the feature relationship of different layers of the simple model with the feature relationship of the last layer of the complex model, constructing a loss function, and guiding the data feature relationship of different layers of the simple model to be close to the data feature relationship of the deepest layer of the complex model, so that the shallow layer of the simple model learns the feature information of the deeper layer; therefore, deep compression of the network is realized. Compared with most existing knowledge distillation methods which are mainly applied to network layers with similar depths of a teacher network and a student network, the method considers the influence of the depth on network improvement, directly carries out distillation learning from a shallow layer to a deep layer, is simple and convenient to implement, is remarkable in effect improvement, and is suitable for popularization and application. And the method can be used together with the existing distillation method to improve the effect.

Description

technical field [0001] The present invention relates to the field of model compression, in particular to a model distillation method, device and medium based on deep network compression. Background technique [0002] So far, deep learning has been widely used in various industries as a mainstream branch of machine learning. However, the high complexity and high computational cost of most deep models make it difficult not only to run on mobile or embedded devices, but also difficult to apply to real-time tasks. How to effectively compress the model to reduce the time and calculation cost while minimizing the loss of model accuracy has received extensive attention in recent years. [0003] As one of the important techniques of model compression, knowledge distillation has been extensively researched and applied in industry since it was first proposed by Hinton in 2015. The main idea of ​​knowledge distillation is to train a complex network model first, and use the complex ne...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06N20/00
CPCG06N20/00
Inventor 苏慧程乐超杨非鲍虎军
Owner ZHEJIANG LAB