Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Mongolian-Chinese neural machine translation method based on multiple constraint terms

A machine translation and multi-constraint technology, applied in the field of machine translation, can solve problems such as inconsistency in evaluation indicators, semantic loss, and training sample distribution deviation in the training and reasoning phase, and achieve the effect of improving readability

Pending Publication Date: 2022-07-29
INNER MONGOLIA UNIV OF TECH
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In the translation model of the prior art, during the training process, due to the lack of Mongolian resources, it is easy to generate training sample distribution deviation and semantic loss, the model training efficiency is not high, and it is easy to cause training sample distribution deviation and inconsistent evaluation indicators in the training inference stage question

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Mongolian-Chinese neural machine translation method based on multiple constraint terms
  • Mongolian-Chinese neural machine translation method based on multiple constraint terms
  • Mongolian-Chinese neural machine translation method based on multiple constraint terms

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0067] The embodiments of the present invention will be described in detail below with reference to the accompanying drawings and examples.

[0068] First, a brief introduction to the Mongolian-Chinese neural machine translation process:

[0069] (1) Mongolian-Chinese neural machine translation

[0070] Principle: Mongolian-Chinese neural machine translation uses a deep learning training model to convert Mongolian (or Cyrillic Mongolian) text into Chinese text. The principle is to preprocess the Mongolian text corpus data through denoising and segmentation. The corresponding embedding vectorized representation is better represented and feature extraction in the model. The model is guided by the selected machine learning algorithm to establish the data mapping relationship (model parameters) based on the bilingual (Mongolian-Chinese), and then This mapping relationship is gradually accurate through multiple iterative training, and finally produces a good generalization ability...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A Mongolian-Chinese neural machine translation method based on multiple constraint terms comprises the steps that firstly, a model training process based on reinforcement learning is constructed for Mongolian-Chinese neural machine translation tasks, and then constraint conditions are further improved for a training optimization target on the basis of a reinforcement model; comprising the following steps: adding a semantic constraint module to relieve the problem of poor translation fluency caused by a single BLEU value evaluation system; performing parameter constraint on the training process to improve the efficiency of model training; and performing word list constraint on the corpus to reduce the number of unregistered words in the translation. According to the method, by adjusting the overall constraint mode, the problems that in a low-resource Mongolian-Chinese machine translation task, a model is poor in sequence structure analysis capacity and low in training efficiency are solved, and meanwhile the problem that variance is large due to intensive training is solved; according to the method, a mean value reward and pruning column search method is adopted to effectively relieve the negative influence caused by the enhanced training.

Description

technical field [0001] The invention belongs to the technical field of machine translation, in particular to a Mongolian-Chinese neural machine translation method based on multiple constraints. Background technique [0002] When using a deep learning model to process Mongolian texts, other indicators other than the standard BLEU value need to be trained and constrained to ensure the quality of Mongolian translations with scarce resources. [0003] In the translation model of the prior art, in the training process, due to the lack of Mongolian language resources, it is easy to generate training sample distribution deviation and semantic loss, the model training efficiency is not high, and it is easy to lead to inconsistent training sample distribution deviation and evaluation indicators in the training inference stage. question. SUMMARY OF THE INVENTION [0004] In order to overcome the shortcomings of the above-mentioned prior art, the purpose of the present invention is ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/58G06N3/04G06N3/08
CPCG06F40/58G06N3/049G06N3/08G06N3/045
Inventor 吉亚图李佳根巴音图师磊樊静苏依拉仁庆道尔吉
Owner INNER MONGOLIA UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products