Mongolian-Chinese non-autoregressive machine translation method based on knowledge graph

A knowledge graph and machine translation technology, applied in the field of Mongolian-Chinese non-autoregressive machine translation based on knowledge graph, can solve problems such as the inability of the decoder to decode in parallel, the lack of target sequence dependencies, and the translation effect not reaching the ideal state of researchers.

Pending Publication Date: 2021-11-16
INNER MONGOLIA UNIV OF TECH
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the Transformer model itself has: Exposure Bias (error transitivity) and the situation that the decoder cannot decode in parallel, which greatly wastes the computing resources of the GPU, and makes the translation effect not reach the ideal state of the researcher.
[0003] Considering that the existing autoregressive machine translation model cannot make full use of the computing power of the GPU, based on this, a non-autoregressive machine translation model (NAT) is proposed, which can make full use of the computing power of the GPU and increase the translation speed and efficiency by 7.2 about times
However, the non-autoregressive machine translation model itself also causes continuous repeated translation and missed translation due to the excessively fast translation. inter-sequence dependencies

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Mongolian-Chinese non-autoregressive machine translation method based on knowledge graph
  • Mongolian-Chinese non-autoregressive machine translation method based on knowledge graph
  • Mongolian-Chinese non-autoregressive machine translation method based on knowledge graph

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] Embodiments of the present invention will be described in detail below with reference to the accompanying drawings and examples.

[0026] First, the present invention is to come up with a pair of parallel sentences used in the corpus "Two villages across the river" as an example the following process.

[0027] like figure 1 , The present invention provides a non-Mongolian and Chinese-based mapping knowledge autoregressive machine translation method, comprising the steps of:

[0028] Step 1, Mongolian and Chinese bilingual named entity constructing semantic web by mapping knowledge triples, where the unknown word is a named entity represents a portion of the knowledge map triples. The present invention is directed to named entities alignment problems introduced triplet mapping knowledge build named entity named entity formed in the center of the Semantic Web context, use can be well aligned named entity context information.

[0029] On the basis of mutual information feat...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A Mongolian-Chinese non-autoregressive machine translation method based on a knowledge graph comprises the following steps: constructing a Mongolian-Chinese bilingual named entity semantic web through a knowledge graph triple, and constructing a named entity dictionary based on multi-feature alignment; then representing Mongolian rare word composition distribution by means of a knowledge graph triple, performing rare word segmentation, performing interactive enhanced generative adversarial training in the process, and adding interactive information to guide generative adversarial to obtain a comprehensive rare word knowledge graph triple set; performing knowledge distillation on the processed Mongolian-Chinese parallel corpus, and constructing a knowledge graph of the Mongolian-Chinese parallel corpus when the teacher model extracts corpora for the student model through knowledge distillation; when non-autoregressive machine translation is carried out, adopting a knowledge graph as a projection matrix training set, obtainng a projection matrix through bidirectional embedding mapping joint training, and carrying out decoding information retouching. According to the method, the translation quality of machine translation can be improved on the premise that the translation rate is improved.

Description

Technical field [0001] The present invention belongs to the technical field of machine translation, Mongolian and Chinese in particular, it relates to non-knowledge based machine translation map autoregressive method. Background technique [0002] Machine translation (MT) is the use of the computer will automatically translate one language into another language, and the conversion process to maintain the same meaning. Research existing machine translation model based on multi-regression translations from machine translation model (AT) study, for example, it is now the hotspot model Transformer, appears the model of its efficient coding efficiency, the effect of superimposed layers of the attention of quality improved significantly. But the Transformer model itself, there are: Exposure Bias (transitive error) and the decoder can not decode the parallel case, which greatly waste of GPU computing resources, so that the translation effect of less than ideal state researchers. [0003...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F40/58G06F40/44G06F16/36G06F40/242G06F40/289G06N5/02
CPCG06F40/58G06F40/44G06F16/367G06F40/242G06F40/289G06N5/02Y02D10/00
Inventor 苏依拉程永坤王涵张妍彤仁庆道尔吉吉亚图
Owner INNER MONGOLIA UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products