Multi-field neural machine translation method based on self-attention mechanism

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A machine translation and attention technology, applied in the field of neural machine translation, can solve the problems of ambiguity, translation performance matching limitation, high maintenance cost, etc., and achieve the effect of improving translation effect.

Pending Publication Date: 2019-07-26

SUZHOU UNIV

View PDF2 Cites 21 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005] The first problem is a well-known problem in domain adaptation in MT: translation performance is limited by the matching between training and test data

[0006] The second problem is more practical: in the case of model inflexibility, multi-domain translation scenarios require multiple domain-specific system architectures, and each time a new domain is involved, a dedicated domain-specific data must be used to retrain a model, lacks architectural scalability and causes high maintenance costs

However, this solution has two obvious disadvantages: 1) domain-specific models can only be adapted to sentence inputs with known domain information, so that each sentence is processed by the corresponding domain model, and 2) each time involves a new domain, a dedicated model must be retrained using domain-specific data

However, in real-world application scenarios, translation requests rarely carry domain information, the concept of domain itself is vague, and domain-specific data is difficult to obtain

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0050] The present invention will be further described below in conjunction with the accompanying drawings and specific embodiments, so that those skilled in the art can better understand the present invention and implement it, but the examples given are not intended to limit the present invention.

[0051] Background: Transformer model for neural machine translation based on self-attention

[0052] Transformer is a fully attention-based encoder-decoder model. Transformer has two significant advantages over attention-based recurrent neural network neural machine translation models: 1) It completely avoids recurrent networks and maximizes parallel training. 2) Using the advantages of the attention mechanism, it can obtain long-distance dependence.

[0053] 1. Encoder

[0054] In an encoder, several identical layers are stacked on top of each other. Each layer contains two sublayers, a multi-head attention layer and a position-wise feed-forward layer, which are connected with...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a multi-field neural machine translation method based on a self-attention mechanism. The invention discloses a multi-field neural machine translation method based on a self-attention mechanism. The multi-field neural machine translation method comprises the following steps: carrying out two important changes on a Transformer, wherein the first change is a self-attention mechanism based on domain perception, and the domain representation is added to a key and a value vector of the original self-attention mechanism, the weight of the attention mechanism is the degree of correlation of the query and domain aware keys, the second change is to add a domain representation learning module to learn a domain vector. The method has the beneficial effects that a domain-aware NMT model architecture is provided on the basis of a neural network architecture Transformer representing the most advanced level at present. A self-attention mechanism based on domain awareness is provided for multi-domain translation. It is known that this is a first attempt on a multi-domain NMT based on a self-attention mechanism. Meanwhile, experiments and analysis also verify that the model can significantly improve the translation effect of each field and can learn the field information of training data.

Description

technical field [0001] The invention relates to the field of neural machine translation, in particular to a multi-domain neural machine translation method based on a self-attention mechanism. Background technique [0002] With the improvement of computer computing power and the application of big data, deep learning has been further applied. Neural Machine Translation based on deep learning has attracted more and more attention. In the field of NMT, the current state-of-the-art neural network architecture Transformer is an attention-based encoder-decoder model. The main idea is to encode the sentence to be translated (collectively referred to as 'source sentence' hereinafter) into a vector representation through an encoder, and then use a decoder to decode the vector representation of the source sentence and translate it into its Corresponding translations (collectively referred to as 'target sentences' hereinafter). [0003] Neural machine translation has made significan...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06F17/28G06F17/22G06F16/33

CPCG06F16/334G06F40/12G06F40/58Y02D10/00

Inventor 熊德意张诗奇

Owner SUZHOU UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Multi-field neural machine translation method based on self-attention mechanism

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology