Training method and device of a machine translation model

A machine translation and training method technology, applied in the electronic field, can solve the problem of low translation accuracy, and achieve the effect of reducing over-learning, avoiding over-fitting, and solving low translation accuracy.
CN109977424AActive Publication Date: 2019-07-05BEIJING SOGOU TECHNOLOGY DEVELOPMENT CO LTD

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Applications(China)
Current Assignee / Owner
BEIJING SOGOU TECHNOLOGY DEVELOPMENT CO LTD
Publication Date
2019-07-05

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The invention discloses a training method of a machine translation model. The method comprises the steps of obtaining a first bilingual sentence pair; deleting punctuation marks in the first bilingualsentence pair according to a preset probability to obtain a second bilingual sentence pair; and training a machine translation model by using the second bilingual sentence pair. The technical effectsof reducing overlearning of a machine translation model on punctuation marks and improving translation accuracy are achieved. Meanwhile, the invention further discloses a training device of the machine translation model.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The invention relates to the field of electronic technology, in particular to a training method and device for a machine translation model. Background technique

[0002] With the accumulation of a large number of bilingual sentence pairs, the improvement of computer hardware computing power, and the advancement of machine translation algorithms, the performance of machine translation has been greatly improved.

[0003] For some languages, bilingual sentence pairs generally have sentence-ending punctuation. For example: Most sentence ending punctuation in Chinese is ".", and most sentence ending punctuation in English is ".". Since there are a large number of common punctuation at the end of this source sentence, and all of them are at the end of the sentence, a very clear pattern will be formed. When using such double sentence pairs to train the machine translation model, it will often cause over-learning of the punctuation at the end of the sentence....

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More