Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Statistical machine translation method and system

A technology of statistical machine translation and translation device, applied in the fields of instruments, computing, special data processing applications, etc., can solve problems such as data sparseness, and achieve the effect of improving the utilization degree, alleviating the problem of data sparseness, and high translation quality

Inactive Publication Date: 2008-10-22
INST OF COMPUTING TECH CHINESE ACAD OF SCI
View PDF0 Cites 26 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The purpose of the present invention is to overcome the data sparsity problem that the existing statistical machine translation system that utilizes phrases to translate is faced when the bilingual corpus is limited, and provides a statistical machine translation method and system based on phrases, so that the statistical machine translation system can Ability to output high-quality translations when bilingual corpora are limited

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Statistical machine translation method and system
  • Statistical machine translation method and system
  • Statistical machine translation method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0052] Phrase-based statistical machine translation methods first need to obtain a bilingual phrase table, and then translate the source language sentences. In the prior art, translating the source language sentence specifically includes the following steps: First, treat the translation source language sentence F' 1 H =f' 1 ...f' H (where, f′ j(j=1...H) represents source language word) carries out phrase division, obtains the phrase sequence divided F 1 ′ H = F ~ 1 ′ K = f 1 ′ X · · · f K ′ Y (where, f′ 1 X Indicates the source language phrase, which contains X source language words; ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a statistic machine translation method and a statistic machine translation system. The method comprises the following steps that: firstly, phrase division of a source language statement is performed and bilingual phrases are searched from a bilingual phrase table according to divided phrases; secondly, the matching degree of the divided phrases and the bilingual phrases is inspected; under the condition of complete matching, the bilingual phrases are added into a candidate phrase table and the fourth step is executed; under the condition of partial matching, the third step is executed; thirdly, a translation template is constructed according to the divided phrases and the bilingual phrases, and word translations of the divided phrases which are different from the bilingual phrases are filled into the translation template to generate novel bilingual phrases which are then added into the candidate phrase table; fourthly, the source language statement to be translated is translated according to the candidate phrase table. The statistic machine translation method and the statistic machine translation system can effectively improve the translation quality under the condition that bilingual corpuses are limited and solve the problem of data sparseness of the statistic machine translation system.

Description

technical field [0001] The invention relates to the technical field of machine translation, in particular to a phrase-based statistical machine translation method and system. Background technique [0002] With the continuous progress of society and the rapid development of economy, international exchanges and cooperation are becoming increasingly close, which puts forward higher requirements for translation between different languages. The translation between natural languages ​​(also known as machine translation) with the help of powerful storage and computing power of computers can greatly reduce translation costs and improve work efficiency. In addition, the booming Internet and multilingual documents provide us with a large number of parallel corpora, laying a solid foundation for statistical machine translation. [0003] Statistical machine translation is a corpus-based translation method. Its main idea is to construct a mathematical model for the translation process, ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/28G06F17/30
Inventor 何中军刘群林守勋
Owner INST OF COMPUTING TECH CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products