Generalized reordering statistic translation method and device based on non-continuous phrase

A statistical translation, discontinuous technology, applied in the field of generalized reordering statistical translation methods and devices, can solve the problems of translation model accuracy limitation and dependence, and achieve the effect of large generalization ability
CN101685441AInactive Publication Date: 2010-03-31INST OF AUTOMATION CHINESE ACAD OF SCI

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Applications(China)
Current Assignee / Owner
INST OF AUTOMATION CHINESE ACAD OF SCI
Publication Date
2010-03-31
Estimated Expiration
Not applicable · inactive patent

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The invention provides a generalized reordering statistic translation method and a device based on non-continuous phrases. The device consists of a word alignment module, a language model module, a phrase extraction module, a maximum entropy classifier training module, a minimum error training module and a decoder, provides a generalized reordering module for statistical machine translation basedon phrases, introduces non-continuous phrases, combines continuous phrases and non-continuous phrases by using regulations for any continuous series in a specified script to be translated so as to acquire continuous target translations as more as possibly, and combines the reordering model with a reordering sub model simultaneously to realize local and global reordering of the phrases so as to acquire final target translations for sentences in the source language. The model can grasp local and global reordering knowledge of the phrases, and can acquire the generalization capability of the phrases through non-continuous phrases. Experiment results prove that the model improves the BLUE rating of the reordering model based on the maximum entropy and a translation model based on hierarchicalphrases by about 1.54 percent and 0.66 percent.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The invention relates to the technical field of natural language processing, and is a new generalized reordering statistical translation method and device based on discontinuous phrases. Background technique

[0002] In statistical machine translation, phrase-based translation models have improved word-based translation models. In the phrase-based translation model, a phrase is any continuous substring without syntactic constraints, which can learn some local knowledge, such as local ordering, or translation of multi-word expressions, and the insertion and insertion of words related to local contexts. delete. However, in phrase-based translation models, key issues such as lack of discontinuous phrases, weak phrase reordering ability, and generalization ability are still not effectively addressed.

[0003] In order to improve phrase-based translation models, two issues must be addressed. One is the type of phrase, which must include both continuous p...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More