Knowledge distillation method and device based on self-attention and computer device
A distillation method and attention technology, applied in the field of artificial intelligence, can solve problems that cannot meet the requirements of different task types, model knowledge distillation training, etc.
Image
Examples
Embodiment Construction
[0047] In order to make the purpose, technical solution and advantages of the present application clearer, the present application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present application, and are not intended to limit the present application.
[0048] refer to figure 1 , a knowledge distillation method based on self-attention in an embodiment of the present application, including:
[0049] S1: Input the input data into the first model to obtain the first feature matrix output by the intermediate layer of the first model, input the input data into the second model to obtain the second feature matrix output by the intermediate layer of the second model, Wherein, the first model is a trained teacher model, the second model is a student model to be trained, and the first feature matrix and the second feature matrix ...
PUM
Login to View More Abstract
Description
Claims
Application Information
- IPC
- G06Q50/20; G06Q10/06; G06N3/04; G06N20/00; G06F17/16
- CPC
- G06Q50/205; G06Q10/067; G06N20/00; G06F17/16; G06N3/045
- Inventors
- 徐泓洋; 王广新



