Sequence classification for machine translation

a machine translation and sequence technology, applied in the field of sequence classification, can solve the problems of insufficient independence assumption, excessive computational requirements for training models, and insufficient independence assumption, in fact incorrect assumptions
US7783473B2Inactive Publication Date: 2010-08-24NUANCE COMM INC

Patent Information

Authority / Receiving Office
US · United States
Patent Type
Patents(United States)
Current Assignee / Owner
NUANCE COMM INC
Publication Date
2010-08-24
Estimated Expiration
Not applicable · inactive patent

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

Classification of sequences, such as the translation of natural language sentences, is carried out using an independence assumption. The independence assumption is an assumption that the probability of a correct translation of a source sentence word into a particular target sentence word is independent of the translation of other words in the sentence. Although this assumption is not a correct one, a high level of word translation accuracy is nonetheless achieved. In particular, discriminative training is used to develop models for each target vocabulary word based on a set of features of the corresponding source word in training sentences, with at least one of those features relating to the context of the source word. Each model comprises a weight vector for the corresponding target vocabulary word. The weights comprising the vectors are associated with respective ones of the features; each weight is a measure of the extent to which the presence of that feature for the source word makes it more probable that the target word in question is the correct one.
Need to check novelty before this filing date? Find Prior Art

Description

BACKGROUND

[0001] The present invention relates to sequence classification such as required when carrying out machine translation of natural language sentences.

[0002] In machine translation, the objective is to translate a source sentence such as the English sentence

[0003] I need to make a collect callinto a target sentence, such as the Japanese version of that sentence This task is a special case of the more general problem known as sequence classification.

[0004] Stated in more general terms, the natural language translation problem can be understood as a specific case of taking a source symbol sequence and classifying it as being a particular target symbol sequence. For convenience, the discussion herein uses the terms “word,”“sentence,” and “translation” rather than “symbol,”“sequence” and “classification,” respectively. It is to be understood, however, that the invention is applicable to the more general case of translating one sequence of symbols into another. It will also be apprec...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More