Model training method and device, computer equipment and storage medium

A model training and translation model technology, applied in the Internet field, can solve problems such as limited translation ability, low model translation accuracy, and single model structure

Pending Publication Date: 2021-01-22
TENCENT TECH (SHENZHEN) CO LTD
View PDF0 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In the prior art, the autoregressive translation model (or non-autoregressive translation model) includes an encoder and a decoder, and when training the regression translation model (or non-autoregressive translation model), it is based on an encoder and a decoder The model structure is single, and the translation ability learned by the model is limited, so the translation accuracy of the trained model is not high

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Model training method and device, computer equipment and storage medium
  • Model training method and device, computer equipment and storage medium
  • Model training method and device, computer equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030]The technical solutions in the embodiments of the present application will be described clearly and completely in conjunction with the accompanying drawings in the embodiments of the present application. Obviously, the described embodiments are only a part of the embodiments of the present application, rather than all the embodiments. Based on the embodiments in this application, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of this application.

[0031]It should be noted that the “first”, “second” and other descriptions involved in the embodiments of this application are only for descriptive purposes, and cannot be understood as indicating or implying their relative importance or implicitly specifying the indicated technology The number of features. Therefore, the technical features defined with "first" and "second" may explicitly or implicitly include at least one such feature.

[0032]In order to ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a model training method and device, computer equipment and a storage medium. The method comprises the steps of obtaining a sample text for model training; calling a sample multitask translation model, wherein the sample multitask translation model comprises a sample encoder, a first sample decoder and a second sample decoder; encoding the sample text based on a sample encoder to obtain text features of the sample text; decoding the text features based on a first sample decoder to obtain a first prediction text of the sample text, and decoding the text features based on asecond sample decoder to obtain a second prediction text of the sample text; and training a sample multi-task translation model according to the sample text, the first prediction text and the secondprediction text to obtain a multi-task translation model. By training the multi-task translation model, the accuracy of model translation can be improved.

Description

Technical field[0001]This application relates to the field of Internet technology, in particular to a model training method, device, computer equipment and storage medium.Background technique[0002]With the continuous development and evolution of deep learning, neural network models have been widely used in fields such as natural language processing, speech recognition, and even computer vision, such as neural network machine translation, natural language understanding, automatic speech recognition, target detection and other practical applications It is widely used.[0003]Neural network models are used in neural network machine translation, and mainly include autoregressive translation models and non-autoregressive translation models. The autoregressive translation models may specifically include Transformer models, and the non-autoregressive translation models may specifically include Mask-Predict models. In the autoregressive translation model, the translation is generated word by ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/58
CPCG06F40/58
Inventor 王星郝永昌焦文祥涂兆鹏
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products