Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Neural machine translation method based on pre-training bilingual word vector

A machine translation and word vector technology, applied in the field of neural machine translation, can solve problems such as the poor initial quality of the word vector matrix, and achieve the effect of improving the effect of machine translation

Inactive Publication Date: 2021-08-24
HARBIN INST OF TECH
View PDF2 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] (2) The word vector matrix of the neural machine translation model is often initialized randomly, and as the training process progresses, the parameters in the word vector matrix are learned and updated. The initial quality of the word embedding matrix is ​​poor and it is easier to overfit during the training update process

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Neural machine translation method based on pre-training bilingual word vector
  • Neural machine translation method based on pre-training bilingual word vector
  • Neural machine translation method based on pre-training bilingual word vector

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] Embodiments of the present invention are described in detail below, and examples of the embodiments are shown in the drawings, wherein the same or similar reference numerals denote the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary and are intended to explain the present invention and should not be construed as limiting the present invention.

[0029] The following describes a neural machine translation method based on pre-trained bilingual word vectors according to an embodiment of the present invention with reference to the accompanying drawings.

[0030] figure 1 It is a flowchart of a neural machine translation method based on pre-trained bilingual word vectors according to an embodiment of the present invention.

[0031] Such as figure 1 As shown, the neural machine translation method based on pre-trained bilingual word vectors includes the f...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a neural machine translation method based on a pre-training bilingual word vector. The method comprises the following steps of: performing'source language-target language 'splicing on labeled and aligned parallel corpora as input of an XLM (X-Language Model) for pre-training; training: taking a bilingual word vector matrix obtained by pre-training to initialize a translation model; inputting a source language into an encoder, inputting vector representation of source language encoding and a corresponding target language into a decoder, outputting a prediction sequence, comparing the prediction sequence with a corresponding target sequence, calculating a loss value, and inputting the loss value into an optimizer to optimize translation model parameters; predicting: in a certain time step, inputting a source language into an optimized encoder, outputting corresponding vector representation by the encoder, inputting the vector representation and a target language word translated by the previous time step into a decoder, outputting a target word of the time step by the decoder, splicing the target words translated by different time steps according to a time sequence, and obtaining a source language translation result. Machine translation effect of low-resource languages is improved.

Description

technical field [0001] The invention relates to the technical field of neural machine translation, in particular to a neural machine translation method based on deep learning and bilingual word vectors. Background technique [0002] Neural machine translation (Neural machine translation, NMT) is a machine translation technology that introduces artificial neural networks for translation. Compared with traditional statistical machine translation (Statistical Machine Translation, SMT), neural machine translation uses an end-to-end "encoder-decoder" architecture. Specifically, this architecture can be divided into the following three categories: [0003] (1) Neural machine translation model based on recurrent neural network. This model uses Long Short-Term Memory (LSTM) or Gated Recurrent Units (GRU) as the unit to build the "encoder-decoder" architecture. At the same time, it introduces an attention mechanism, so that when each target word is generated, the decoder can focus...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/216G06F40/284G06F40/58G06N3/04G06N3/08
CPCG06F40/58G06F40/216G06F40/284G06N3/08G06N3/047G06N3/048G06N3/045
Inventor 朱聪慧赵铁军刘哲宁曹海龙杨沐昀徐冰
Owner HARBIN INST OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products