Neural machine translation decoding acceleration method based on non-autoregression

A machine translation and autoregressive technology, applied in the field of neural machine translation inference acceleration, can solve the problem that the decoding speed is difficult to meet the real-time response requirements

Active Publication Date: 2020-07-07
沈阳雅译网络技术有限公司
View PDF7 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0010] Aiming at the problems of neural machine translation model error propagation during inference and the decoding speed in actual use is difficult to meet the real-time response requirements, the technical problem to be solved by the present invention is to provide a non-autoregressive neural machine translation decoding acceleration method, which can make Large-scale neural machine translation models can have higher response speed and better practical application, while the translation quality of the model can still be guaranteed

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Neural machine translation decoding acceleration method based on non-autoregression
  • Neural machine translation decoding acceleration method based on non-autoregression
  • Neural machine translation decoding acceleration method based on non-autoregression

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] The present invention will be further elaborated below in conjunction with the accompanying drawings of the description.

[0045] The present invention will optimize the reasoning speed of the neural machine translation system based on the attention mechanism from the perspective of non-autoregressive decoding, aiming at greatly improving the decoding speed of the machine translation system while only losing relatively small model performance.

[0046] The present invention proposes a non-autoregressive neural machine translation decoding acceleration method, comprising the following steps:

[0047] 1) Use the Transformer model based on the self-attention mechanism to construct an autoregressive neural machine translation model including an encoder-decoder;

[0048] 2) Construct parallel corpus for training, perform word segmentation and word segmentation preprocessing process, obtain source language sequence and target language sequence, generate machine translation vo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a neural machine translation decoding acceleration method based on non-autoregression. The neural machine translation decoding acceleration method comprises the steps of: constructing an autoregression neural machine translation model through employing a Transformer model based on an autoattention mechanism; constructing training parallel corpora, generating a machine translation word list, and training the two models from left to right and from right to left until convergence; constructing a non-autoregression machine translation model; acquiring codec attention and hidden layer states of a left-to-right autoregression translation model and a right-to-left autoregression translation model; calculating the difference between the output and the output corresponding to the autoregressive model, and taking the difference as additional loss for model training; extracting source language sentence information, and predicting a corresponding target language sentence bya decoder; and calculating the loss of predicted distribution and real data distribution, decoding translation results with different lengths, and further acquiring an optimal translation result. According to the neural machine translation decoding acceleration method, knowledge in the regression model is fully utilized, and 8.6 times of speed increase can be obtained under the condition of low performance loss.

Description

technical field [0001] The invention relates to a neural machine translation inference acceleration method, in particular to a non-autoregressive neural machine translation decoding acceleration method. Background technique [0002] Machine translation is the technique of translating one natural language into another. Machine translation is a branch of natural language processing, one of the ultimate goals of artificial intelligence, and has important scientific research value. At the same time, with the rapid development of Internet technology, machine translation technology has played an increasingly important role in people's daily life and work. [0003] From the rule-based method in the 1970s to the instance-based method in the 1980s, the statistical method in the 1990s, and the neural network-based method today, machine translation technology has achieved good results after years of development. more widely used in daily life. [0004] Although the traditional rule-...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/58G06N3/08
CPCG06N3/08Y02D10/00
Inventor 杨木润朱靖波肖桐张春良
Owner 沈阳雅译网络技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products