Unlock instant, AI-driven research and patent intelligence for your innovation.

Translation model generation method and device

A translation model and translation technology, applied in the computer field, can solve the problems of high cost of manual translation, small amount of corpus, limited effect improvement, etc., and achieve high robustness

Active Publication Date: 2020-04-17
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF10 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] However, the amount of corpus that can be obtained through the method of data mining is very small, and the method of manual translation is extremely expensive
At the same time, if only a small amount of colloquial bilingual corpus can be obtained, the final effect improvement is relatively limited

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Translation model generation method and device
  • Translation model generation method and device
  • Translation model generation method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] The present disclosure will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain related inventions, rather than to limit the invention. It should also be noted that, for the convenience of description, only the parts related to the related invention are shown in the drawings.

[0028] It should be noted that, in the case of no conflict, the embodiments in the present disclosure and the features in the embodiments can be combined with each other. The present disclosure will be described in detail below with reference to the accompanying drawings and embodiments.

[0029] figure 1 An exemplary system architecture 100 of an embodiment of the method for generating a translation model or the apparatus for generating a translation model of the present disclosure can be applied.

[0030] Such as figure 1 As shown, the system architec...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention discloses a translation model generation method and device. One specific embodiment of the method comprises the steps of acquiring an original corpus pair set, whereineach original corpus pair comprises a to-be-translated statement and a translation; for an original corpus pair in the original corpus pair set, segmenting words of a to-be-translated statement of theoriginal corpus pair, randomly editing the words in the to-be-translated statement for at least one time to generate at least one new corpus, and forming at least one new corpus pair with a translation of the original corpus pair; calculating a translation score of each new corpus pair by utilizing a pre-trained initial translation model; for the original corpus pairs in the original corpus pairset, determining the new corpus pair with the highest translation score and with the translation score higher than a preset threshold value in at least one new corpus pair generated based on the original corpus pair as a spoken language corpus pair; and training the initial translation model by using the spoken language corpus pair to obtain a spoken language translation model. According to the embodiment, the robustness of the translation system for the spoken language problem is improved.

Description

technical field [0001] The embodiments of the present disclosure relate to the field of computer technology, and in particular to a method and device for generating a translation model. Background technique [0002] With the continuous maturity of speech recognition technology and machine translation technology, many speech-oriented translation products have emerged, such as translators, conference interpretation, etc. At the same time, speech translation is different from text translation, and there is a serious problem of colloquialism. [0003] Machine translation technology needs to learn translation rules from a large amount of bilingual corpus, so that for a given source language sentence, the translation model can automatically give a suitable translation. Because most of the bilingual corpora that can be collected are sentences with standardized expressions, the trained translation model is more suitable for translating the source language sentences with standardized...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/51G06F40/58
Inventor 曲宇涛张睿卿熊皓何中军李芝
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD