Translation rule extraction method and translation method based on dependency syntax tree

A technology that relies on syntax trees and rules, applied in special data processing applications, instruments, electrical digital data processing, etc. Issues such as ordering relationships

Inactive Publication Date: 2011-11-16
INST OF COMPUTING TECH CHINESE ACAD OF SCI
View PDF5 Cites 25 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

But the existing dependency syntax tree to string model (reference 1: Deyi Xiong, Qun Liu, and Shouxun Lin. A Dependency Treelet String Correspondence Model for Statistical Machine Translation. In Proceedings of Second Workshop on Statistical Machine Translation. 2007.) with The source language depends on any connected subgraph in the syntactic tree as the basic structure of the translation rules. This kind of translation rules has no clear linguistic meaning. More importantly, this kind of translation rules cannot express all the ordering relationships, so he

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Translation rule extraction method and translation method based on dependency syntax tree
  • Translation rule extraction method and translation method based on dependency syntax tree
  • Translation rule extraction method and translation method based on dependency syntax tree

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0047] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below through specific embodiments in conjunction with the accompanying drawings. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0048] In one embodiment of the present invention, a method for extracting translation rules based on a dependency syntax tree is provided. This method extracts translation rules from a corpus containing triples, which are source language-dependent syntax tree, target language string, and word alignment relationship between source language and target language, that is (source language-dependent syntax tree, target language string, alignment). In this embodiment, the alignment relationship between the source language and the target language is passed through the alignment tool GIZA++ (...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a translation rule extraction method and a translation method based on a dependency syntax tree. A translation reordering relationship is directly expressed in a translation rule of which a source end has dependency syntax tree segments consisting of a head and all dependent components and a target end has strings, so the translation rule can definitely guide a translation process. The translation rule which is extracted by the translation rule extraction method can improve the performance of the translation method based on the dependency syntax tree. The invention has the advantage that: on a data set of 1.54-million bilingual parallel corpuses, compared with the performance of a component tree-to-string model, the performance of a dependency syntax tree-to-string translation model is improved by 1.68 Bleu points.

Description

technical field [0001] The invention belongs to the technical field of natural language processing, and in particular relates to a statistical machine translation method based on a dependency syntax tree. Background technique [0002] Dependency grammars are one of the most popular grammars in natural language processing. Compared with the phrase structure grammar, the dependency grammar has both grammatical and semantic information, and has the following characteristics: the dependency structure has the best phrasal cohesion properties; the dependency edge gives the semantic information. Therefore, dependency grammars are very attractive resources in the field of machine translation. But the existing dependency syntax tree to string model (reference 1: Deyi Xiong, Qun Liu, and Shouxun Lin. A Dependency Treelet String Correspondence Model for Statistical Machine Translation. In Proceedings of Second Workshop on Statistical Machine Translation. 2007.) with The source langua...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/28
Inventor 米海涛刘群
Owner INST OF COMPUTING TECH CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products