Unlock instant, AI-driven research and patent intelligence for your innovation.

Model training method and device, computer equipment and storage medium

A model training and model technology, applied in neural learning methods, biological neural network models, semantic analysis, etc., can solve the problem of low recognition accuracy, achieve high accuracy, save training time, and fast convergence.

Pending Publication Date: 2022-08-05
SHENZHEN ZHUIYI TECH CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Complex models can be compressed to meet operational requirements through knowledge distillation technology, but the accuracy of text recognition by student models obtained through knowledge distillation is often low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Model training method and device, computer equipment and storage medium
  • Model training method and device, computer equipment and storage medium
  • Model training method and device, computer equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0119] In order to make the purpose, technical solutions and advantages of the present application more clearly understood, the present application will be described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are only used to explain the present application, but not to limit the present application.

[0120] The model training method provided in the embodiment of the present application can be applied to figure 1in the application environment shown. The terminal 102 communicates with the server 104 through the network. The data storage system may store data that the server 104 needs to process. The data storage system can be integrated on the server 104, or it can be placed on the cloud or other network server. The server 104 obtains the text sample data sent by the terminal 102, inputs the text sample data into the first model, obtains the first sample feature d...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a model training method and device, computer equipment, a storage medium and a computer program product. The method comprises the steps of inputting text sample data into a first model, and determining first loss according to obtained first sample feature data; inputting the text sample data into a second model, and determining a second loss according to the obtained second sample feature data and the first sample feature data; inputting the text sample data into a third model to obtain third sample feature data, and determining the similarity between the first sample feature data and the third sample feature data based on a preset condition to obtain third loss, or obtaining a third loss according to the similarity between the third sample feature data and the similarity between the first sample feature data and the third sample feature data; and determining a loss function according to the first loss, the second loss and the third loss, wherein the loss function is used for training the first model. According to the scheme, the convergence speed of the first model can be higher, and the text recognition accuracy is higher.

Description

technical field [0001] The present application relates to the technical field of natural language processing, and in particular, to a model training method, apparatus, computer equipment, storage medium and computer program product. Background technique [0002] With the development of deep learning, the use of deep neural networks in natural language processing is increasing. In order to improve the performance of the model, most models are relatively complex, with a large number of parameters and large memory consumption, which are difficult to directly apply to the problem. On devices with limited application resources such as GPU (graphics processing unit, graphics processor) and smart phones. [0003] Knowledge distillation belongs to a transfer learning method, which is to transfer the performance of one model to another model. For teacher-student network, the teacher network is often a more complex network with better performance and generalization ability. The teach...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/30G06N3/08
CPCG06F40/30G06F18/214
Inventor 张旭文博刘云峰
Owner SHENZHEN ZHUIYI TECH CO LTD