Unlock instant, AI-driven research and patent intelligence for your innovation.

Output regularization method based on teacher model classification layer weight

A model and teacher technology, applied in the field of computing, can solve the problems of long training time and large resources, and achieve the effect of fast training speed, high classification accuracy and less training resources

Pending Publication Date: 2022-07-22
ZHEJIANG UNIV OF TECH
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The present invention solves the problems existing in the prior art, provides an optimized output regularization method based on the weight of the classification layer of the teacher model, and overcomes the excessive resources required for training the student model by using the teacher model in the prior art , the problem of too long training time

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Output regularization method based on teacher model classification layer weight
  • Output regularization method based on teacher model classification layer weight
  • Output regularization method based on teacher model classification layer weight

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] The present invention will be further described in detail below with reference to the embodiments, but the protection scope of the present invention is not limited thereto.

[0047] The present invention relates to an output regularization method based on the weight of the classification layer of a teacher model. The method converts the weight of the classification layer of the teacher model after supervised training into a correlation matrix between categories, with each row in the matrix As the soft label of the corresponding category, it provides additional information for the student model and participates in the training of the student model; the student model with the highest accuracy rate is selected as the final target model.

[0048] The classification is image classification.

[0049] In the present invention, the output regularization method based on the weight of the classification layer in the teacher model consists of six parts: data set preparation, data ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a teacher model classification layer weight-based output regularization method, which comprises the following steps of: converting the weight of a classification layer of a teacher model subjected to supervised training into a correlation matrix among categories, and taking each row in the matrix as a soft label of a corresponding category to provide additional information for a student model and participate in training of the student model; and selecting the student model with the highest accuracy as a final target model. According to the method, the information provided by the teacher model is fully utilized, the problems that the teacher model occupies too large training resources and the overall training time is too long in the training process are reduced, even if some neural network models can only provide the weight of a teacher model classifier layer, the student model can be trained through the method, and the training efficiency is improved. Compared with the prior art, the method has higher classification accuracy and wider model applicability, has higher training speed, only needs fewer training resources, and can further regularize the network model under the condition of fewer resources.

Description

technical field [0001] The present invention relates to the technical field of calculation; calculation or counting, in particular to an output regularization method based on the weight of the classification layer of the teacher model in the field of image classification in deep learning. Background technique [0002] In deep neural networks, neural network models with a large number of parameters have achieved excellent performance in supervised learning tasks for image classification. However, such neural network models often overfit labeled training samples, resulting in poor generalization. ability. This overfitting phenomenon is the main and common problem after training modern deep neural network models with millions of parameters with labeled datasets. In order to solve this overfitting problem, researchers at home and abroad have proposed different Solutions, including regularization methods. [0003] The regularization method includes regularization at the input a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06V10/764G06V10/82G06K9/62G06N3/04G06N3/08
CPCG06N3/04G06N3/08G06F18/241
Inventor 梅建萍仇文豪
Owner ZHEJIANG UNIV OF TECH