Check patentability & draft patents in minutes with Patsnap Eureka AI!

Multi-language-based neural machine translation model

A machine translation and translation model technology, applied in the field of neural machine translation, can solve problems such as data imbalance, high similarity, and weak language correlation, and achieve the effect of improving quality and reducing the number of parameters

Pending Publication Date: 2021-06-29
XINJIANG UNIVERSITY
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The multilingual corpus in the network is very rare, and most of the multilingual parallel corpus has the problem of data imbalance, and the language correlation in the multilingual parallel corpus is not strong, so the Russian-Russian-Uzbek-Uyghur language - English-Chinese multilingual parallel corpus, the similarity between the five languages ​​is high and the amount of data is equal

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-language-based neural machine translation model

Examples

Experimental program
Comparison scheme
Effect test

specific Embodiment approach

[0028] like figure 1 As shown, the multilingual-based neural machine translation model includes a multilingual data set construction process, a multilingual translation system, and a multilingual translation model operation process. Artificially construct multilingual parallel corpus and multilingual neural machine translation model; the former mainly constructs Russian-Uzbek-Uyghur-English-Chinese multilingual parallel corpus, with the help of existing bilingual data, and translates it using translation tools such as Mavericks and Google The other 3 languages, and build multilingual parallel corpus by calculating similarity and manual screening; the latter builds a multilingual neural machine translation model based on the transformer framework, and uses the constructed data set to train the multilingual translation model.

[0029] The multilingual dataset construction process includes the following steps:

[0030] (1) English-Russian, Chinese-Russian, English-Chinese, Chine...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a multi-language-based neural machine translation model, and relates to the field of neural machine translation, and the multi-language-based neural machine translation model comprises a multi-language data set construction process, a multi-language translation system and a multi-language translation model operation process. A multi-language parallel corpus and a multi-language neural machine translation model are manually constructed; the multi-language parallel corpus construction method mainly constructs multi-language parallel corpora of Russian-Uzburk-Uygur-English-Chinese, by means of existing bilingual data, other three languages are obtained through translation by means of translation tools such as xiaoniu and google, and the multi-language parallel corpora are constructed through similarity calculation and manual screening. The multi-language neural machine translation model is constructed on the basis of a transform framework, and the constructed data set is used for training the multi-language translation model.

Description

technical field [0001] The invention relates to the field of neural machine translation, in particular to a multilingual-based neural machine translation model. Background technique [0002] Nowadays, there are artificially constructed multilingual parallel corpora and multilingual neural machine translation models; the former mainly constructs Russian-Uzbek-Uyghur-English-Chinese multilingual parallel corpus, with the help of existing bilingual data, using Mavericks and Google, etc. The translation tool translates the other three languages, and constructs multilingual parallel corpora through calculation of similarity and manual screening; the latter builds a multilingual neural machine translation model based on the transformer framework, and uses the constructed data set to train the multilingual translation model. The multilingual corpus in the network is very rare, and most of the multilingual parallel corpus has the problem of data imbalance, and the language correlati...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F40/58G06F40/126G06N3/04
CPCG06F40/58G06F40/126G06N3/045
Inventor 艾山·吾买尔刘婉月帕力旦·吐尔逊早克热·卡德尔宜年汪烈军刘胜全
Owner XINJIANG UNIVERSITY
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More