Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Zero parallel corpus multi-modal neural machine translation method

A machine translation and parallel corpus technology, applied in the field of machine translation, can solve the problems of low-quality training data, limited effect of picture description models, affecting the quality of neural machine translation models, etc., and achieve the effect of accurate translation results.

Active Publication Date: 2019-09-17
UNIV OF SCI & TECH OF CHINA
View PDF3 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although the above method can realize the training of the neural machine translation model by constructing parallel corpus, due to the limited effect of the picture description model, the wrong description generated by it will lead to the formation of low-quality parallel corpus training data, thereby affecting the trained neural machine translation model quality, resulting in less accurate translation results

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Zero parallel corpus multi-modal neural machine translation method
  • Zero parallel corpus multi-modal neural machine translation method
  • Zero parallel corpus multi-modal neural machine translation method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0014] The technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some of the embodiments of the present invention, not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0015] An embodiment of the present invention provides a zero-parallel corpus multimodal neural machine translation method, such as figure 1 As shown, it mainly includes the following steps:

[0016] Step 11, pre-training the pre-built neural machine translation model by using the monolingual corpus of the source language and the target language with corresponding picture information.

[0017] In the embodiment of the present invention, the monoli...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a zero parallel corpus multi-modal neural machine translation method, which comprises the following steps: pre-training a pre-constructed neural machine translation model by utilizing a source language with corresponding picture information and a target language monolingual corpus; translating sentences in the source language into sentences in a target language by using a pre-trained neural machine translation model; calculating a reward value of a sentence level according to an internal relation between the sentence obtained by translation and a corresponding picture, and updating pre-trained neural machine translation model parameters by adopting a strategy gradient reinforcement learning method by taking the maximum expected total reward value as an optimization target so as to obtain a trained neural machine translation model; and translating the given source language sentence by using the trained neural machine translation model. According to the zero parallel corpus multi-modal neural machine translation method, a neural machine translation model of a zero resource language pair can be established by utilizing the internal relation between pictures and characters.

Description

technical field [0001] The invention relates to the technical field of machine translation, in particular to a zero-parallel corpus multi-modal neural machine translation method. Background technique [0002] End-to-end neural machine translation (hereinafter referred to as neural machine translation) has achieved rapid development both at home and abroad in recent years. Neural machine translation abandons the cumbersome structure and complex feature design in statistical machine translation, and directly sends parallel corpus to the neural network to complete the training of a complete translation system. [0003] In neural machine translation, even the simplest single-layer model has a large number of parameters (usually tens of millions of parameters) to optimize, while training a complex model with a large number of parameters in machine learning requires a lot of training data. As the model design of neural machine translation becomes more and more complex, and the d...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/28G06N3/04G06N3/08
CPCG06N3/04G06N3/08G06F40/58
Inventor 陈恩红刘淇王怡君魏天心
Owner UNIV OF SCI & TECH OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products