Neural machine translation decoding acceleration method based on discrete variables

A machine translation, discrete technology, applied in the field of neural machine translation decoding acceleration based on discrete variables, can solve problems such as the inability to take advantage of low-precision numerical calculations

Active Publication Date: 2020-07-07
沈阳雅译网络技术有限公司
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] In view of the fact that the machine translation method in the prior art relies too much on single-precision and double-precision floating point and cannot take advantage of low-precision numerical operations, the technical problem to be solved by the present invention is to provide a neural machine translation decoding acceleration method based on discrete variables , making full use of the natural advantages of low computational complexity of fixed-point numbers, based on the latest implementation of fast reasoning, and on the premise of almost no decline in model performance, the real-time response speed can be improved

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Neural machine translation decoding acceleration method based on discrete variables
  • Neural machine translation decoding acceleration method based on discrete variables
  • Neural machine translation decoding acceleration method based on discrete variables

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0049] The present invention will be further elaborated below in conjunction with the accompanying drawings of the description.

[0050] A kind of neural machine translation decoding acceleration method based on discrete variables of the present invention comprises the following steps:

[0051] 1) Construct a training parallel corpus and a neural machine translation model based on the attention mechanism, use the parallel corpus to generate a machine translation vocabulary, the decoder decodes and generates target sentences according to the extracted information, and continuously updates the model parameters so that the generated target sentences are consistent with the real The translation results are closer, and the neural machine translation model training process is completed; the model parameters after training convergence are used as the baseline system;

[0052] 2) In the baseline system, by scaling the single-precision floating-point parameters in the model, the parame...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a neural machine translation decoding acceleration method based on discrete variables. The neural machine translation decoding acceleration method comprises the steps of: constructing a neural machine translation model which is used for training parallel corpora and based on an attention mechanism, and taking a model parameter after training convergence as a baseline system; acquiring a scaling factor (scale) of each tensor in the baseline system through adopting a quantization method, and initializing a unified scaling factor (base_scale) for the whole model; carryingout operation on a neural machine translation model; calculating to obtain a common scaling factor for the scaling factors from different operations before carrying out an addition operation, so as toensure that the magnitudes of the parameters are consistent; and performing inverse quantization on the output of the neural machine translation model, sending a model output result to a normalization function, and acquiring a final translation result. By utilizing the natural advantage of low operation complexity of a fixed point number, the real-time corresponding speed is improved on the premise that the model performance is not degraded on the basis of the latest implementation of rapid reasoning.

Description

technical field [0001] The invention relates to a neural machine translation decoding acceleration technology, in particular to a neural machine translation decoding acceleration method based on discrete variables. Background technique [0002] Machine translation (Machine Translation) is the use of computer programs to translate a natural language into another natural language, which belongs to the category of computational linguistics. In 1949, Warren Weaver published a memorandum titled "Translation", which marked that machine translation based on modern computers officially entered the stage of history. Machine translation not only involves human cognition of its own language and way of thinking, but also involves many fields such as artificial intelligence, information theory, knowledge engineering, and software engineering. It is a discipline that intersects multiple technologies in depth. In the past ten years, the research and industrialization of machine translatio...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F40/42G06F40/58G06N3/04
CPCG06N3/044Y02D10/00
Inventor 杜权朱靖波肖桐张春良
Owner 沈阳雅译网络技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products