Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A machine translation method

A technology of machine translation and translation probability, applied in the field of natural language processing, which can solve the problems of high translation error rate, useless exchange structure, and no consideration of the influence of the target language side.

Inactive Publication Date: 2011-12-07
INST OF SOFTWARE - CHINESE ACAD OF SCI
View PDF3 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The method based on hierarchical phrases extracts phrases with intervals from word-aligned bilingual sentences, and uses the expansion of intervals to obtain a hierarchical syntactic analysis tree. In the process of sentence structure generation, this method only considers the input The information of the source language sentence does not consider the impact of the target language on translation
The method based on reverse transcription grammar allows two forms of word position exchange (preservation and reverse order), and the number of words in each exchange is limited to two. Therefore, the generated sentence structure is expressed in the form of a binary tree. The shortcoming of the method is that the position exchange of words is limited to only between two nodes, and there may be too many useless exchange structures in actual translation, which leads to the problem of high translation error rate

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A machine translation method
  • A machine translation method
  • A machine translation method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0072] The present invention will be described in more detail below through specific embodiments in conjunction with the accompanying drawings.

[0073] Such as figure 1 as shown, figure 1 The implementation flow chart of the overall technical solution of the machine translation method based on the improved bilingual syntax tree structure provided by the present invention, the method includes the following steps:

[0074] Step 1) convert the bilingual sentence of word alignment into an improved bilingual syntax tree structure;

[0075] This example shows how to generate an improved bilingual syntax tree structure ( figure 2 ).

[0076] Given word-aligned bilingual sentences:

[0077]

[0078] Generate the corresponding bilingual syntax tree structure, the specific process is described as follows:

[0079] a) Express the word alignment relationship of bilingual sentences in the form of an alignment matrix, as follows

[0080]

[0081] Among them, ● indicates that t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a machine translation method and belongs to the technical field of natural language processing. The method of the present invention is: 1) converting the bilingual sentence of word alignment into a bilingual syntax tree structure; 2) extracting phrases with structural attributes at each layer of the bilingual syntax tree, and calculating the phrase translation probability to form a phrase translation table; 3) According to the phrase translation table, use the search algorithm to translate the bilingual sentences to be translated; among them, the tree nodes of the bilingual syntax tree are bilingual word pairs or bilingual phrase pairs that are mutually translated, and the source language end of the parent node of the syntax tree is owned by the parent node The order-preserving combination of the source language end of the child node is obtained, and the target language end is obtained by combining the target language ends of all the son nodes of the parent node in the set word combination order, and the combination of the nodes in the adjacent upper and lower layers in the syntax tree at the target language end The order is reversed; the combination order includes order preservation or reverse order. The invention achieves the effect of improving the translation quality by improving the internal structure of the translation candidate.

Description

technical field [0001] The invention belongs to the technical field of natural language processing, in particular, the invention relates to a machine translation method. Background technique [0002] The analysis of sentence structure in statistical machine translation methods can improve the quality of translations. At present, there are two main methods for sentence structure analysis, one is the linguistic syntax method that uses a syntax analyzer that conforms to the linguistic meaning to analyze the sentence structure (refer to K.Yamada and K.Knight.2001.A Syntax- based Statistical Translation Model.in Proceedings of ACL.p.523-530. and Y.Liu, Q.Liu, and S.Lin.2006.Tree-to-String AlignmentTemplate for Statistical Machine Translation.in Proceedings of ACL.p.609 -616.), the other is a formalized syntax method that does not require a clear syntax analysis process (refer to D. Wu, Stochastic Inversion Transduction Grammars and Bilingual Parsing of Parallel Corpora. Computat...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/28
Inventor 张大鲲孙乐李文波
Owner INST OF SOFTWARE - CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products