A Method for Avoiding Duplication of Segments in Machine Translation
A machine translation and translation technology, applied in natural language translation, instruments, computing, etc., can solve the problems of large limitations, repeated punishment strategies cannot be fully effective, and achieve the effect of avoiding translation repetition.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0041] The present invention will be described in further detail below in conjunction with the accompanying drawings and specific embodiments.
[0042] Such as figure 1 As shown, this embodiment discloses a method for avoiding the repetition of machine translation translation segments. During the decoding process of the greedy algorithm of machine translation, the duplicate segment detection mechanism of the translation is used to punish the generation probability of the repeated target words. As the length of the repeated segment increases, the generation probability of the target word is punished at the logarithmic level, linear level, and exponential level in turn, so as to avoid the purpose of machine translation to generate repeated segments, specifically including the following steps:
[0043] Step 1: Data processing: Process the bilingual parallel corpus in the form of sentence pairs, the form is: source language sentence, target language sentence, namely (s i , t i )...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


