Supercharge Your Innovation With Domain-Expert AI Agents!

A Mongolian-Chinese neural machine translation method based on a triangular architecture

A machine translation and neural technology, applied in the field of machine translation, can solve the problems of scarcity of Mongolian and Chinese parallel corpora and limited parallel corpus, and achieve the effect of improving quality and stabilizing the training process

Pending Publication Date: 2019-03-08
INNER MONGOLIA UNIV OF TECH
View PDF3 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In order to overcome the shortcomings of the above-mentioned prior art, the object of the present invention is to provide a Mongolian-Chinese neural machine translation method based on a triangular framework. This method is mainly aimed at the problem of limited parallel corpora in small languages, especially the scarcity of Mongolian-Chinese parallel corpora. Mongolian (z) is introduced into the translation between English (x) and Chinese (y) as an intermediate hidden variable, and the translation between English and Chinese is decomposed into two steps via Mongolian

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Mongolian-Chinese neural machine translation method based on a triangular architecture
  • A Mongolian-Chinese neural machine translation method based on a triangular architecture
  • A Mongolian-Chinese neural machine translation method based on a triangular architecture

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] The implementation of the present invention will be described in detail below in conjunction with the drawings and examples.

[0052] Problem description: Based on the Mongolian-Chinese neural machine translation method based on the triangular architecture, a unified two-way EM algorithm is used to jointly optimize the Mongolian translation model.

[0053] Let z represent Mongolian, y represent Chinese, and x represent English. The unified two-way generalized EM process is as follows:

[0054] The training of the translation of x→y is decomposed into two stages to train two translation models, the first model x→z generates a potential translation of z from the input sentence of x, and the second model z→y generates y based on the potential translation For the final translation of the language, both processes use an end-to-end encoder-decoder structure; in addition, following the steps of the standard EM algorithm and the Jensen inequality, the lower bound of p(y|x) on t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

In order to change the status quo that machine translation of low-resource languages is relatively backward, the invention discloses a Mongolian-Chinese neural machine translation method based on a triangular structure. Compared with the existing pure end-to-end neural machine translation method, the method of the invention fully considers the problem of limited parallel corpus in small languages,especially the problem of scarcity of Mongolian-Chinese parallel corpus, and improves the quality of Mongolian-Chinese translation under the premise of scarcity of parallel corpus. Secondly, a unified bi-directional EM algorithm is used to jointly optimize the Mongolian translation model. Finally, the pseudo-samples generated from the model x to z or z to y and the real bilingual samples are mixed in the same small batch at a ratio of 1: 1 to stabilize the training process.

Description

technical field [0001] The invention belongs to the technical field of machine translation, and in particular relates to a Mongolian-Chinese neural machine translation method based on a triangular framework. Background technique [0002] Machine translation, which can use computers to automatically translate one language into another, is one of the most powerful means to solve language barriers. In recent years, many large search companies and service centers such as Google and Baidu have conducted large-scale research on machine translation, making important contributions to obtaining high-quality translations of machine translation, so the translation between major languages ​​is close to that of human beings. At the translation level, millions of people communicate across language barriers using online translation systems and mobile apps. In the wave of deep learning in recent years, machine translation has become the top priority and has become an important part of prom...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/28
CPCG06F40/42G06F40/58
Inventor 苏依拉孙晓骞王宇飞高芬张振牛向华赵亚平
Owner INNER MONGOLIA UNIV OF TECH
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More