Generalized reordering statistic translation method and device based on non-continuous phrase
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Applications(China)
- Current Assignee / Owner
- INST OF AUTOMATION CHINESE ACAD OF SCI
- Publication Date
- 2010-03-31
- Estimated Expiration
- Not applicable · inactive patent
Smart Images
Figure 1 Figure 2 Figure 3
Abstract
Description
technical field
[0001] The invention relates to the technical field of natural language processing, and is a new generalized reordering statistical translation method and device based on discontinuous phrases. Background technique
[0002] In statistical machine translation, phrase-based translation models have improved word-based translation models. In the phrase-based translation model, a phrase is any continuous substring without syntactic constraints, which can learn some local knowledge, such as local ordering, or translation of multi-word expressions, and the insertion and insertion of words related to local contexts. delete. However, in phrase-based translation models, key issues such as lack of discontinuous phrases, weak phrase reordering ability, and generalization ability are still not effectively addressed.
[0003] In order to improve phrase-based translation models, two issues must be addressed. One is the type of phrase, which must include both continuous p...