Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Multimodal machine translation method, device, electronic device and storage medium

A machine translation and multi-modal technology, applied in the computer field, can solve the problems of lack of interpretability in translation performance, and achieve the effect of improving interpretability, reducing complexity, and good translation results

Active Publication Date: 2021-07-27
INST OF AUTOMATION CHINESE ACAD OF SCI
View PDF9 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The present invention provides a multi-modal machine translation method, device, electronic equipment and storage medium, which are used to solve the defect of lack of interpretability in the improvement of translation performance in the prior art, and realize the interpretability of improving translation quality

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multimodal machine translation method, device, electronic device and storage medium
  • Multimodal machine translation method, device, electronic device and storage medium
  • Multimodal machine translation method, device, electronic device and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] In order to make the purpose, technical solutions and advantages of the present invention clearer, the technical solutions in the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the present invention. Obviously, the described embodiments are part of the embodiments of the present invention , but not all examples. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0041] In order to make the multimodal translation model have better interpretability while improving the translation quality, each embodiment of the present application abandons the sentence-level multimodal fusion method adopted in the prior art, and introduces an entity-based The multi-modal machine translation method of cross-modal information fusion at the level, only integrates the corr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention provides a multimodal machine translation method, device, electronic device and storage medium. The method includes: determining a source language text to be translated; inputting the source language text into a translation model to obtain a target language output by the translation model Text; the translation model is obtained by joint reconstruction model training based on the sample source language text, the sample target language text, and the sample images matching the sample source language text; the translation model and the reconstruction model share the feature coding layer, and the feature coding during the model training process The layer is used to encode the first sequence and the second sequence, the translation model translates based on the encoding of the first sequence, the reconstruction model reconstructs based on the encoding of the second sequence, the first sequence is determined based on the sample source language text, and the second sequence is based on the sample source. The non-entity determination of each entity in the language text in the region image of the sample image and the sample source language text improves the interpretability of quality improvement and reduces the complexity of translation.

Description

technical field [0001] The present invention relates to the field of computer technology, in particular to a multimodal machine translation method, device, electronic equipment and storage medium. Background technique [0002] Multimodal machine translation refers to the use of modal information other than text information to help text translation, such as using images to help improve the translation quality of image descriptions. The premise of this approach is that images contain more complete information than single sentences. [0003] The general multi-modal machine translation model is designed for the multi-modal environment of the text language mode and the static image mode. The data form is a picture with a sentence image description and the translation of the image description. The semantic fusion method adopted Generally, it can be divided into the following two categories: one is to input visual information into the translation system as the context of the sente...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/58G06F40/295G06F40/126G06K9/00
CPCG06F40/58G06F40/295G06F40/126G06V30/40
Inventor 宗成庆黄鑫张家俊周玉
Owner INST OF AUTOMATION CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products