Unlock instant, AI-driven research and patent intelligence for your innovation.

A Mongolian-Chinese machine translation method for placeholder disambiguation based on pointer generation network

A technology of machine translation and network implementation, applied in the field of machine translation

Active Publication Date: 2022-03-18
INNER MONGOLIA UNIV OF TECH
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] With the emergence of the Transformer model, BERT also appeared. Compared with the previous word embedding method represented by word2vec, the BERT model further increases the generalization ability of the word vector model, fully describing the character level, word level, sentence level and even between sentences. Relational features can model polysemy to a certain extent, but it requires a large amount of data sets, and it has a great impact on Mongolian-Chinese translation, a language with a small corpus

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Mongolian-Chinese machine translation method for placeholder disambiguation based on pointer generation network
  • A Mongolian-Chinese machine translation method for placeholder disambiguation based on pointer generation network
  • A Mongolian-Chinese machine translation method for placeholder disambiguation based on pointer generation network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0014] The implementation of the present invention will be described in detail below in conjunction with the drawings and examples.

[0015] refer to figure 1 , the present invention is a Mongolian-Chinese neural machine translation method based on a pointer generation network to realize placeholder disambiguation, based on an encoder-decoder architecture, and an auxiliary network and a backbone network are added. The auxiliary network generates a binary gate for each input source word position, dynamically selects the words to focus on, and the backbone network is a pointer generation network with gating mechanism K for attention. The backbone network jointly dynamically selects the auxiliary network that the child elements focus on. When translating, the binary gates generated by the auxiliary network are used to dynamically select the sub-elements of interest to avoid unnecessary weight assignment calculations.

[0016] In the encoding stage, the source text is encoded in...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A method for Mongolian-Chinese machine translation based on pointer generation network to realize placeholder disambiguation, based on encoder-decoder architecture, is characterized in that it also includes an auxiliary network and a backbone network, and the auxiliary network is a lexical position for each input source Generate a binary gate to dynamically select the vocabulary to be focused on. The backbone network is a pointer generation network with a gating mechanism for attention; in the encoding stage, the source text is encoded into a hidden layer through the word embedding layer of the encoder State, and then the gating mechanism determines whether the information from the current state flows in or is replaced by placeholders; in the decoding stage, the ability to copy source text and generate new vocabulary is used to generate network pointers, and the placeholder context is used for decoding. Before the final data output, additional modules are used to perform linguistic checks to detect possible translation anomalies and mark them, and adjust relevant parameters to achieve the best translation effect.

Description

technical field [0001] The invention belongs to the technical field of machine translation, in particular to a Mongolian-Chinese machine translation method for realizing placeholder disambiguation based on a pointer generation network. Background technique [0002] With the rapid economic development in various regions of the world, more and more attention has been paid to the communication between different languages, and machine translation has emerged accordingly. The development of the modern Internet has driven the upsurge of machine translation research, but the current machine translation has not yet achieved the effect of human translation. [0003] After the three stages of rule-based translation, statistical machine translation, and neural network machine translation, the translation effect has become more and more remarkable, but the problems that have always existed have not been really solved, such as polysemy, grammatical problems, etc., so for improving the ma...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/58G06N3/04G06N3/08
CPCG06F40/58G06N3/084G06N3/044G06N3/045
Inventor 苏依拉程永坤崔少东张妍彤仁庆道尔吉石宝
Owner INNER MONGOLIA UNIV OF TECH