A structured text translation method and device

A structured, textual technology, applied in the field of machine translation, which can solve problems such as inability to apply well, difficult to translate structured text, and lack of alignment information for neural machine translation.

Active Publication Date: 2020-08-28
TSINGHUA UNIV +1
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Although neural machine translation is excellent in translating plain text, it still cannot be applied to translating structured text well, because the translation of structured text needs to satisfy structural constraints, for example, the source end is between a pair of HTML tags The translation corresponding to the content of the target must also be included between the same pair of HTML tags, but because in the existing neural machine translation, there is no structured text corpus for training the model for structured text translation; and , neural machine translation lacks explicit alignment information and cannot add structural constraints, making it difficult for existing neural machine translation to translate structured text

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A structured text translation method and device
  • A structured text translation method and device
  • A structured text translation method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0023] Machine translation is the process of converting one natural language into another natural language through a computer. In recent years, the rapid development of neural network machine translation technology has significantly improved the quality of machine translation. Although the existing machine translation has excellent results when tr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

Embodiments of the present invention provide a method and device for translating structured text, including: removing the structural markup of the target structured text to be translated to obtain the target text; inputting the target text into the trained text translation neural network model In this method, search and translate the translation candidates of the target text according to the phrase search space to obtain the target translation text and alignment information; according to the alignment information, carry out structured mark recovery processing on the target translation text to obtain the target structured Translate text. In the embodiment of the present invention, by removing the structured markup of the structured text, the text without the structured markup is translated through the neural network model based on the phrase search space, and the translated text is restored to the structured markup to obtain a structured translation Text, which realizes the translation of structured text through the neural network model.

Description

technical field [0001] The invention relates to the technical field of machine translation, in particular to a structured text translation method and device. Background technique [0002] In recent years, the rapid development of neural network machine translation technology has significantly improved the quality of machine translation. Furthermore, the improvement of the quality of machine translation has also made it widely used in real life. [0003] Although neural machine translation is excellent in translating plain text, it still cannot be well applied to translating structured text, because the translation of structured text needs to satisfy structural constraints, for example, the source end is between a pair of HTML tags The translation corresponding to the content of the target must also be included between the same pair of HTML tags, but because in the existing neural machine translation, there is no structured text corpus for training the model for structured t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/58G06F40/289
CPCG06F40/289G06F40/58
Inventor 刘洋张嘉成栾焕博孙茂松翟飞飞许静芳
Owner TSINGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products