Machine translation engine training system and method and trained machine translation engine

A machine translation and training system technology, applied in the computer field, can solve problems such as translation errors, translation quality degradation, machine translation engine interference, etc., and achieve the effect of improving the labeling ability

Pending Publication Date: 2022-05-17
盐城睿行空间企业孵化器有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, on the one hand, subword segmentation may separate tags, which may easily lead to translation errors when machine translation engines translate complex sentences; on the other hand, adding tags to the original text actually adds It is easy to cause interference to the machine translation engine (especially the machine translation engine without special training in this area), and the label may be translated correctly, but due to the influence of the label information, the translation quality of other content in the translation may decline

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Machine translation engine training system and method and trained machine translation engine
  • Machine translation engine training system and method and trained machine translation engine
  • Machine translation engine training system and method and trained machine translation engine

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] The following will be combined with the accompanying drawings, an exemplary embodiment of the present invention will be described in detail.

[0028] Figure 1 Schematically shows a machine translation engine training system 1 according to an embodiment of the present invention. as Figure 1As shown, the machine translation engine training system 1 includes a preprocessing replacement module 10, a replacement information storage module 20, a subword segmentation holding module 30, a translation engine training module 40 and a replacement information restoration module 50.

[0029] It is necessary to point out that after training the machine translation engine, the training effect needs to be verified to find a machine translation model that is constantly optimizing and improving. Accordingly, according to an embodiment of the present invention, the machine translation engine training system 1 has two modes of operation: training mode and training effect verification mode....

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a machine translation engine training system and method and a trained machine translation engine. The machine translation engine training system comprises a preprocessing replacement module, a replacement information storage module, a sub-word segmentation keeping module, a translation engine training module and a replacement information restoration module. Wherein the preprocessing replacement module is used for replacing received bilingual sentence pairs or label pairs in an original text to be translated with special label pairs, and only the label pairs with the same label content are replaced with the same special label pairs. And the replacement information restoration module restores the special mark in the received translation in the training effect verification mode or when the engine executes the translation task so as to evaluate the training effect of the machine translation engine or output the final translation. According to the method and the device, the processing capability of the machine translation engine on formatted information, especially tags, in the translated text can be effectively improved, so that the machine translation engine can process superscript information and subscript information in the translated file with high reliability.

Description

Technical field [0001] The present invention relates to the field of computer technology, specifically to machine translation engine training systems, methods and trained machine translation engines. Background [0002] Current mainstream machine translation engines, usually built on neural network models, use the Encoder-Decoder framework. The input of the encoder module is a matrix composed of word vectors of words (sub-words) in the sentence, and the matrix undergoes a series of transformations to output a new matrix representing the entire sentence information, and each vector in the new matrix is the encoding of each word in the original text and its context information. The decoder module receives the original vector encoded by the encoder module, combines the vector of the translated word that the decoder has output, calculates the vector representing the next translation word, and finally determines the next word through the softmax (normalization index) layer. [0003] ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/58G06F40/289
CPCG06F40/58G06F40/289
Inventor 余畅张方元杨攀杨尚为杨子辰
Owner 盐城睿行空间企业孵化器有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products