Translation text integrity evaluation method based on bilingual text structure information

A technology of structural information and integrity, applied in special data processing applications, instruments, electronic digital data processing, etc., can solve problems such as damage to the integrity of semantic units, misleading readers' understanding of translations, and failure to consider semantic integrity. The effect of translation quality

Active Publication Date: 2015-09-16
INST OF AUTOMATION CHINESE ACAD OF SCI
View PDF3 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in the existing machine translation system, because the division of phrases and ordering do not take into account the problem of semantic integrity, the final translation result is only determined according to the scores of translation probability and language model, so the translation often appears Cases where semantic unit integrity is compromised
This not only affects the fluency and coherence of the entire translation, but also misleads readers' understanding of the translation.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Translation text integrity evaluation method based on bilingual text structure information
  • Translation text integrity evaluation method based on bilingual text structure information
  • Translation text integrity evaluation method based on bilingual text structure information

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with specific embodiments and with reference to the accompanying drawings.

[0042] All codes of the present invention are realized with C++ programming language, and the development platform is Ubuntu Linux 8.04. Since the written program does not use any platform-related code, the system implementation can also run on the Windows operating system.

[0043] The basic idea of ​​the present invention is that in the decoding process based on the hierarchical phrase translation model, the decoder can fully and properly mine the semantic integrity information provided by the source and target textual linguistic knowledge, thereby further improving the translation quality of the current statistical machine translation .

[0044] figure 1 It shows the flow chart of the translation system that integrates...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a translation text integrity evaluation method based on bilingual text structure information. The method includes the following steps of firstly, extracting training corpus used for establishing a target end text unit integrity model from a target end texture structure tree; secondly, establishing the target end text unit integrity model through the training corpus generated in the first step; thirdly, integrating the target end text unit integrity model in a logarithm linear translation model, and conducting translation generation through an adaptability decoding method. According to the method, with the help of the bilingual text structure information, a decoder can further improve the translation quality of the current statistical machine translation by fully and approximately utilizing the semantic integrity information based on bilingual text language knowledge.

Description

technical field [0001] The invention relates to the technical field of natural language processing, in particular to a statistical machine translation method based on discourse analysis. Background technique [0002] Machine translation (machine translation, MT) refers to the translation of a natural language (usually called source language, source language) into another natural language (usually called target language, target language) with the help of computer technology. [0003] After more than 20 years of development, the research on statistical machine translation has achieved a series of innovative results, and both translation models and practical systems are constantly being improved and improved. From word-based translation models to phrase-based translation models, and then to syntax-based translation models, people have gradually integrated linguistic knowledge into statistical machine translation. Currently, machine translation can achieve good results for some ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/28G06F17/27
Inventor 周玉涂眉宗成庆
Owner INST OF AUTOMATION CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products