Unlock instant, AI-driven research and patent intelligence for your innovation.

Multidirectional recurrent neural network machine translation model training method and device

A technology of cyclic neural network and machine translation, which is applied in the field of machine translation, can solve problems such as semantic differences and achieve the effect of improving performance

Pending Publication Date: 2021-06-29
XINJIANG UNIVERSITY
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, there is a deviation between the semantics contained in the sentences obtained by this method and the features predicted by the decoder.
In addition, the size of the data, the noise contained in the data, and the modeling ability of the model may make the model unable to fully fit the training data, resulting in the semantics contained in the target sentence translated by the model and the semantics contained in the real target sentence. difference

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multidirectional recurrent neural network machine translation model training method and device
  • Multidirectional recurrent neural network machine translation model training method and device

Examples

Experimental program
Comparison scheme
Effect test

specific Embodiment approach

[0016] combine figure 1 and figure 2 , the multi-directional recurrent neural network machine translation model consists of three parts: initial translation, recurrent source translation and recurrent target translation. During training, the characteristics of parallel data are used to regenerate the source sentence sequence and Target sentence sequence and optimize some parameters of the initial translation model by calculating the loss of the reconstructed sentence sequence to improve the performance of the initial translation model. For the context vector of the sentence sequence obtained by different reconstructions and the source of the initial translation model output The sentence context vector at the end or the sentence context vector at the target end calculates the similarity.

[0017] The multi-directional cyclic neural network machine translation model training method comprises the following steps:

[0018] (1) Preprocessing parallel data: word segmentation, bpe...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a multi-directional recurrent neural network machine translation model training method and device, and relates to the field of machine translations, the multi-directional recurrent neural network machine translation model comprises initial translation, recurrent source end translation and recurrent target end translation, during training, the characteristics of parallel data are utilized, in a training stage, a source end sentence sequence and a target end sentence sequence are regenerated through a translation model, and part of parameters of an initial translation model are optimized by calculating loss of the sentence sequence generated through reconstruction so as to improve performance in the initial translation model. calculating the similarity between the context vector of the sentence sequence obtained by different reconstructions and the context vector of the source end sentence or the context vector of the target end sentence output by the initial translation model.

Description

technical field [0001] The invention relates to the field of machine translation, in particular to a multi-directional recurrent neural network machine translation model training method and device. Background technique [0002] The current training scheme of the translation model is to provide each sentence with a real target sequence, and use the cross-entropy loss method to approximate the probability distribution of the generated characters to the probability distribution of the characters in the real target sentence, and then use various search strategies to obtain the probability distribution of the characters The probability distribution is processed to get the target sentence. However, there is a deviation between the semantics contained in the sentences obtained by this method and the features predicted by the decoder. In addition, the size of the data, the noise contained in the data, and the modeling ability of the model may make the model unable to fully fit the ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/58G06F40/53G06F40/289G06N3/04
CPCG06F40/58G06F40/53G06F40/289G06N3/044
Inventor 艾山·吾买尔宜年早克热·卡德尔张大任韩越刘婉月买合木提·买买提吐尔根·依布拉音汪烈军刘胜全
Owner XINJIANG UNIVERSITY
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More