Translation rule extraction method and translation method based on dependency grammar tree

A technology that depends on syntax trees and rules, and is applied in special data processing applications, instruments, and electrical digital data processing. Linguistic meaning and other issues

Inactive Publication Date: 2012-11-28
INST OF COMPUTING TECH CHINESE ACAD OF SCI
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

But the existing dependency syntax tree to string model (reference 1: Deyi Xiong, Qun Liu, and Shouxun Lin. ADependency Treelet String Correspondence Model for Statistical Machine Translation. In Proceedings of Second Workshop on Statistical Machine Translation. 2007.) is based on the source Arbitrary connected subgraphs in the language-dependent syntactic tree are used as the basic structure of translation rules. This kind of translation rules has no clear linguistic meaning. More importantly, this kind of translation rules cannot express all order relations. Order model to constrain the word order of translated strings to complete the entire translation process
In addition, even if heuristics or ordering models are introduced to constrain the word order of translation results, the performance of existing dependency syntax tree-to-string models still lags behind mainstream component tree-to-string models (Reference 2: Yang Liu, Qun Liu, and Shouxun Lin.2006. Tree-to-String Alignment Template for Statistical Machine Translation. In Proceedings of COLING / ACL 2006, pages 609-616, Sydney, Australia, July.)

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Translation rule extraction method and translation method based on dependency grammar tree
  • Translation rule extraction method and translation method based on dependency grammar tree
  • Translation rule extraction method and translation method based on dependency grammar tree

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0047] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below through specific embodiments in conjunction with the accompanying drawings. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0048] In one embodiment of the present invention, a method for extracting translation rules based on a dependency syntax tree is provided. This method extracts translation rules from a corpus containing triples, which are source language-dependent syntax trees, target language strings, and word alignment relationships between source language and target language, that is, (source language-dependent syntax trees, target language string, alignment). In this embodiment, the alignment relationship between the source language and the target language is determined by the alignment tool GIZA...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a translation rule extraction method and a translation method based on a dependency grammar tree. A translation sequence adjusting relationship is directly expressed in the translation rule that a source end is used as a head word and a dependency grammar tree fragment and a target end consisting of modifiers of the head word are used as strings, and thus the translation rule can be used for definitely guiding the translation process. According to the translation rule extracted by the method, the performance of the translation method based on the dependency grammar tree can be improved. On a data set of 1.54 million of parallel bilingual corpus, the performance of a dependency grammar tree to a string translation model is improved by 1.68 BLEU (Bilingual Evaluation Understudy) points compared with that of a component tree to the string model.

Description

technical field [0001] The invention belongs to the technical field of natural language processing, and in particular relates to a statistical machine translation method based on a dependency syntax tree. Background technique [0002] Dependency grammars are one of the most popular grammars in natural language processing. Compared with the phrase structure grammar, the dependency grammar has both grammatical and semantic information, and has the following characteristics: the dependency structure has the best phrasal cohesion properties; the dependency edge gives the semantic information. Therefore, dependency grammars are very attractive resources in the field of machine translation. But the existing dependency syntax tree to string model (reference 1: Deyi Xiong, Qun Liu, and Shouxun Lin. ADependency Treelet String Correspondence Model for Statistical Machine Translation. In Proceedings of Second Workshop on Statistical Machine Translation. 2007.) is based on the source ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/28
Inventor 谢军米海涛刘群
Owner INST OF COMPUTING TECH CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products