Neural Network Mongolian-Chinese
Machine Translation Method Based on
Encoder-Decoder is provided. The method comprises the following steps of using an
encoder e and two-layer decoders d1 and d2, encoding the Mongolian source language into a vector
list by the
encoder E, Then, at the
hidden layer of the
encoder, adopting a retrospective step with attention mechanism, In the decoding process, obtaining the implied state before softmax and the draft
sentence by the decoder D1, and then taking the implied state of the encoder E and the decoder D1 as the input of the decoder D2 to obtain the secondchannel sequence, i.e. The final translation. At first, that Chinese corpus is divided into words in the preprocess stage, The Mongolian-Chinese
bilingual corpus is segmented into stem, affixes and cases, and the Mongolian-Chinese
bilingual corpus is segmented into word segments (BPE), which can effectively refine the translation
granularity and reduce the number of unknown words, and then the Mongolian-
Chinese word vector is constructed by Word2vec. For unknown words, a Mongolian-Chinese dictionary of proprietary vocabulary is also constructed, which can effectively improve the quality of translation.