Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A Statistical Machine Translation Method Fused with Translation Memory and Phrase Translation Model

A technology of statistical machine translation and translation memory, applied in the field of natural language processing, can solve problems such as translation systems that cannot be automated, and achieve the effect of improving translation quality, improving translation quality, and speeding up work efficiency

Active Publication Date: 2016-06-29
INST OF AUTOMATION CHINESE ACAD OF SCI
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the reference translation given by the translation memory software is the translation of the sentence most similar to the sentence to be translated, not the direct translation of the sentence to be translated, and it needs to be manually modified
Therefore, translation memory software can only be used as an auxiliary translation tool for professional translation, and cannot be used as an automatic translation system alone

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Statistical Machine Translation Method Fused with Translation Memory and Phrase Translation Model
  • A Statistical Machine Translation Method Fused with Translation Memory and Phrase Translation Model
  • A Statistical Machine Translation Method Fused with Translation Memory and Phrase Translation Model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0020] The present invention will be described in detail below in conjunction with the accompanying drawings. It should be pointed out that the described examples are only considered for the purpose of illustration and not limitation of the present invention.

[0021] All codes of the present invention are realized with C++ programming language, and the development platform is UbuntuLinux8.04. Since the written program does not use any platform-related code, the system implementation can also run on the Windows operating system.

[0022] The basic idea of ​​the present invention is to fully and properly excavate the translation memory information on the basis of the phrase translation model, and propose a translation method integrating the translation memory and the phrase translation model to improve the translation quality of statistical machine translation.

[0023] figure 1 It shows the flow chart of the translation method of the fusion of translation memory and phrase t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a statistics machine translation method integrating translation memory and a phrase translation model. The statistics machine translation method comprises a first step of utilizing a training set to obtain bilingual phrase segmentation sentence pairs; a second step of obtaining corresponding translation memory phrase pairs in the translation memory according to the obtained bilingual phrase segmentation sentence pairs, and extracting relevant characteristics of the translation memory phrase pairs; and a third step of integrating the phrase translation model and the extracted relevant characteristics of the translation memory phrase pairs, and finally obtaining target translation results of current to-be-translated sentences. The statistics machine translation method is a method which can be used for fully and appropriately digging information provided by the translation memory on the basis of a traditional phrase translation model so as to improve statistics machine translation quality.

Description

technical field [0001] The invention relates to the technical field of natural language processing, in particular to a statistical machine translation method integrating translation memory and a phrase-based translation model. Background technique [0002] Statistical machine translation is a technology that automatically learns translation rules from bilingual parallel corpora and effectively uses these translation rules to automatically translate sentences to be translated. Statistical machine translation mainly includes word-based models, phrase-based models and translation models based on syntax tree structures. Among them, the phrase-based translation model and the syntactic tree-based machine translation model are the current mainstream methods of machine translation. [0003] After more than two decades of development, statistical machine translation has made great progress, and the quality of translation has been continuously improved. Between some special language...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/28G06F17/27
Inventor 汪昆宗成庆苏克毅
Owner INST OF AUTOMATION CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products