Training method and device of translation model, text processing method and device and storage medium

A translation model and training method technology, applied in the fields of text processing methods, translation model training methods, devices and storage media, can solve problems such as unfavorable translation models being widely used, affecting translation model training accuracy and training speed, and noise interference. , to achieve the effect of improving training accuracy, training speed, and strong generalization ability.

Pending Publication Date: 2019-12-20
TENCENT TECH (SHENZHEN) CO LTD
View PDF0 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Due to the complexity of the decoder task, it requires high-precision training samples that have been denoised. For small languages ​​or languages ​​that lack training samples, the

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Training method and device of translation model, text processing method and device and storage medium
  • Training method and device of translation model, text processing method and device and storage medium
  • Training method and device of translation model, text processing method and device and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0064] In order to make the objectives, technical solutions, and advantages of the present invention clearer, the present invention will be further described in detail below with reference to the accompanying drawings. The described embodiments should not be regarded as limiting the present invention. Those of ordinary skill in the art have not made All other embodiments obtained under the premise of creative work belong to the protection scope of the present invention.

[0065] In the following description, "some embodiments" are referred to, which describe a subset of all possible embodiments, but it is understood that "some embodiments" may be the same subset or different subsets of all possible embodiments, and Can be combined with each other without conflict.

[0066] Before describing the embodiments of the present invention in further detail, the terms and terms involved in the embodiments of the present invention will be described. The terms and terms involved in the embodi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a training method of a translation model. The training method comprises the steps of obtaining a first training sample set; denoising the first training sample set to form a corresponding second training sample set; processing the first training sample set through a translation model to determine initial parameters of the translation model; responding to the initial parameters of the translation model, processing the second training sample set through the translation model, and determining updating parameters of the translation model; and iteratively updating encoder parameters and decoder parameters of the translation model through the first training sample set and the second training sample set according to the updating parameters of the translation model. The invention further provides a text processing method and device and a storage medium. According to the method, the generalization ability of the translation model can be stronger, the training precision and the training speed of the translation model are improved, and meanwhile, the gain of existing noise statements on model training can be effectively and fully utilized, so that the translation modelcan adapt to different use scenes.

Description

technical field [0001] The present invention relates to machine translation (MT, Machine Translation) technology, in particular to a translation model training method, text processing method, device and storage medium. Background technique [0002] With the development of machine translation, Neural Machine Translation (NMT, Neural Machine Translation) has been widely used as a new generation of translation technology. The neural network machine translation system is built based on the encoder-decoder framework. However, in the translation process of the neural network machine translation system, the decoder has multiple tasks, such as recording the current translated content and the content to be translated, and recording the translated content. Fluency related information, etc. Due to the complexity of the decoder task, it requires high-precision training samples that have been denoised. For small languages ​​or languages ​​that lack training samples, the noise interferen...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/28G06N3/08
CPCG06N3/08
Inventor 伍海江袁松岭王晓利
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products