Translation model training method, statement translation method and device and storage medium

A translation model and sentence technology, applied in the computer field, can solve problems such as poor robustness, excessive sensitivity to small disturbances, and changes in translation results, and achieve the effect of improving robustness.

Active Publication Date: 2019-11-19
TENCENT TECH (SHENZHEN) CO LTD
View PDF14 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, since the neural machine translation model is based on a complete neural network, the global nature of its modeling causes each output of the target end to depend on each word input by the source end, making it overly sensitive to small disturbances in the input
For example, in the translation from Chinese to English, the user inputs "They are not afraid of difficulties to make Go AI", the English translation given by the machine translation model is "They are not afraid of difficulties to make Go AI", however, when the user enters a similar The sentence "They are not afraid of difficulties to make Go AI", the output of machine translation has changed drastically, and the result is "They are not afraid to make Go AI", although the user just replaced one of the words with a synonym, the translation result drastic changes have taken place
[0004] It can be seen that the stability of the current neural machine translation, that is, the robustness is relatively poor

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Translation model training method, statement translation method and device and storage medium
  • Translation model training method, statement translation method and device and storage medium
  • Translation model training method, statement translation method and device and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] Embodiments of the present application are described below in conjunction with the accompanying drawings. Apparently, the described embodiments are only part of the embodiments of the present application, not all of the embodiments. Those of ordinary skill in the art know that, with the development of technology and the emergence of new scenarios, the technical solutions provided in the embodiments of the present application are also applicable to similar technical problems.

[0042] The embodiment of the present application provides a translation model training method, which can improve the robustness and translation quality of machine translation. Embodiments of the present application also provide corresponding sentence translation methods, computer equipment, terminal equipment, and computer-readable storage media. Each will be described in detail below.

[0043] With the development of artificial intelligence, the accuracy of machine translation is getting higher ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a translation model training method comprising the steps pf acquiring a training sample set wherein the training sample set comprises a plurality of training samples; determining a disturbance sample set corresponding to each training sample in the training sample set, with the disturbance sample set comprising at least one disturbance sample, the semantic similarity between the disturbance sample and the corresponding training sample being higher than a first preset value; and training the initial translation model by using the plurality of training samples and the disturbance sample set corresponding to each training sample to obtain a target translation model. According to the scheme provided by the embodiment of the invention, a disturbance sample is introducedinto model training, so that the robustness and translation quality of machine translation can be improved.

Description

technical field [0001] The present application relates to the field of computer technology, and in particular to a translation model training method, a sentence translation method, computer equipment, terminal equipment, and a computer-readable storage medium. Background technique [0002] With the development of artificial intelligence, machine translation has been widely used, such as simultaneous interpretation and chat content translation, which are based on machine translation to convert one input language into another language output. [0003] Neural machine translation is a machine translation model based entirely on neural networks. It has achieved a good level of translation in many language pairs and has been widely used in various machine translation products. However, since the neural machine translation model is based on a complete neural network, the global nature of its modeling causes each output of the target end to depend on each word input by the source en...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/28G06F17/27
CPCG06F40/44G06F40/284G06F40/216G06F40/30G06F40/247G06N3/08G06N7/01G06N3/045G06F18/214G06F18/2413G06F40/40G06F9/30196
Inventor 程勇涂兆鹏孟凡东翟俊杰刘洋
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products