Unlock instant, AI-driven research and patent intelligence for your innovation.

Multi-modal machine translation method and device, electronic equipment and storage medium

A machine translation, multimodal technology, applied in the computer field, can solve the problem of lack of interpretability in translation performance, and achieve the effect of improving interpretability, improving performance, and reducing complexity

Active Publication Date: 2021-05-14
INST OF AUTOMATION CHINESE ACAD OF SCI
View PDF9 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The present invention provides a multi-modal machine translation method, device, electronic equipment and storage medium, which are used to solve the defect of lack of interpretability in the improvement of translation performance in the prior art, and realize the interpretability of improving translation quality

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-modal machine translation method and device, electronic equipment and storage medium
  • Multi-modal machine translation method and device, electronic equipment and storage medium
  • Multi-modal machine translation method and device, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] In order to make the purpose, technical solutions and advantages of the present invention clearer, the technical solutions in the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the present invention. Obviously, the described embodiments are part of the embodiments of the present invention , but not all examples. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0041] In order to make the multimodal translation model have better interpretability while improving the translation quality, each embodiment of the present application abandons the sentence-level multimodal fusion method adopted in the prior art, and introduces an entity-based The multi-modal machine translation method of cross-modal information fusion at the level, only integrates the corr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a multi-modal machine translation method and device, electronic equipment and a storage medium. The method comprises the steps: determining a to-be-translated source language text; inputting the source language text into a translation model to obtain a target language text output by the translation model, wherein the translation model is obtained based on a sample source language text, a sample target language text and a sample image matched with the sample source language text by combining training of a reconstruction model, the translation model and the reconstruction model share a feature coding layer, the feature coding layer is used for coding a first sequence and a second sequence in a model training process, the translation model performs translation based on coding of the first sequence, the reconstruction model performs reconstruction based on coding of the second sequence, the first sequence is determined based on a sample source language text, the second sequence is determined based on the regional image of each entity in the sample source language text in the sample image and the non-entity of the sample source language text, thereby improving the interpretability of the quality improvement and reducing the translation complexity.

Description

technical field [0001] The present invention relates to the field of computer technology, in particular to a multimodal machine translation method, device, electronic equipment and storage medium. Background technique [0002] Multimodal machine translation refers to the use of modal information other than text information to help text translation, such as using images to help improve the translation quality of image descriptions. The premise of this approach is that images contain more complete information than single sentences. [0003] The general multi-modal machine translation model is designed for the multi-modal environment of the text language mode and the static image mode. The data form is a picture with a sentence image description and the translation of the image description. The semantic fusion method adopted Generally, it can be divided into the following two categories: one is to input visual information into the translation system as the context of the sente...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/58G06F40/295G06F40/126G06K9/00
CPCG06F40/58G06F40/295G06F40/126G06V30/40
Inventor 宗成庆黄鑫张家俊周玉
Owner INST OF AUTOMATION CHINESE ACAD OF SCI