Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Structured text translation method and device

A structured and textual technology, applied in the field of machine translation, can solve the problems that cannot be well applied, unable to add structural constraints, difficult to translate structured text, etc.

Active Publication Date: 2019-09-13
TSINGHUA UNIV +1
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Although neural machine translation is excellent in translating plain text, it still cannot be applied to translating structured text well, because the translation of structured text needs to satisfy structural constraints, for example, the source end is between a pair of HTML tags The translation corresponding to the content of the target must also be included between the same pair of HTML tags, but because in the existing neural machine translation, there is no structured text corpus for training the model for structured text translation; and , neural machine translation lacks explicit alignment information and cannot add structural constraints, making it difficult for existing neural machine translation to translate structured text

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Structured text translation method and device
  • Structured text translation method and device
  • Structured text translation method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0023] Machine translation is the process of converting one natural language into another natural language through a computer. In recent years, the rapid development of neural network machine translation technology has significantly improved the quality of machine translation. Although the existing machine translation has excellent results when tr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention provides a structured text translation method and device, and the method comprises the steps: removing a structured mark of a to-be-translated target structured text, and obtaining a target text; inputting the target text into a trained text translation neural network model, and performing search translation on the translation candidate words of the target text according to a phrase search space to obtain a target translation text and alignment information; and according to the alignment information, performing structured mark recovery processing on the target translated text to obtain a target structured translated text. According to the embodiment of the invention, the structured mark of the structured text is removed; therefore, the text with the structural mark removed is translated through the neural network model based on the phrase search space, the translated text is restored to the structural mark, the structured translated text is obtained, andthe translated text is translated through the neural network model.

Description

technical field [0001] The invention relates to the technical field of machine translation, in particular to a structured text translation method and device. Background technique [0002] In recent years, the rapid development of neural network machine translation technology has significantly improved the quality of machine translation. Furthermore, the improvement of the quality of machine translation has also made it widely used in real life. [0003] Although neural machine translation is excellent in translating plain text, it still cannot be well applied to translating structured text, because the translation of structured text needs to satisfy structural constraints, for example, the source end is between a pair of HTML tags The translation corresponding to the content of the target must also be included between the same pair of HTML tags, but because in the existing neural machine translation, there is no structured text corpus for training the model for structured t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/28G06F17/27
CPCG06F40/289G06F40/58
Inventor 刘洋张嘉成栾焕博孙茂松翟飞飞许静芳
Owner TSINGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products