Unlock instant, AI-driven research and patent intelligence for your innovation.

Machine translation method applied to stable deep machine translation model

A machine translation and model technology, applied in the field of machine translation model training, can solve the problems of aggravating gradient disappearance and gradient explosion, affecting the accuracy of calculation, and achieve the effect of reducing uncertainty, easy reproducibility, and good versatility

Pending Publication Date: 2022-05-24
沈阳雅译网络技术有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the use of half-precision number calculations will inevitably affect the calculation accuracy to a certain extent, and because there are calculation errors in the calculation of half-precision numbers, when using half-precision numbers for training, it will also aggravate the neural network during reverse calculation. Vanishing gradient and exploding gradient problems

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Machine translation method applied to stable deep machine translation model
  • Machine translation method applied to stable deep machine translation model
  • Machine translation method applied to stable deep machine translation model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] The present invention is further elaborated below in conjunction with the accompanying drawings of the specification.

[0036] A machine translation method of the present invention applied to a stable deep machine translation model, comprising the following steps:

[0037] 1) In the training process of the deep model, when performing each backpropagation calculation according to the input source statement sub-X, set two matrix μ and σ for the gradient matrix G of the encoder part of the parameters in the deep model;

[0038] 2) Using the matrix μ and σ represent the mean and standard deviation of gradient matrix G respectively, and calculate the values of matrix μ and σ according to gradient matrix G;

[0039] 3) In each backpropagation process during model training, the gradient matrix G is standardized using matrix μ and σ;

[0040] 4) Control the use stage and position of this standardization through preset strategies and prior knowledge, realize the rapid and accurate t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a machine translation method applied to a stable deep machine translation model, which comprises the following steps of: in the training process of a deep model, setting two matrixes mu and sigma for a gradient matrix G of partial parameters of an encoder in the deep model when each back propagation calculation is carried out according to an input source statement X; respectively representing a mean value and a standard deviation of the gradient matrix G by using matrixes mu and sigma, and calculating values of the matrixes mu and sigma according to the gradient matrix G; standardizing the gradient matrix G by using matrixes mu and sigma in each back propagation process during model training; the standardized use stage and position are controlled through a preset strategy and priori knowledge, rapid and accurate training of a machine translation model is carried out, and finally the translation quality of an output translation during decoding is improved. According to the method, the training process can be more stable and is not limited by specific models and tasks, the method has good universality, and the effect of the model can be more easily reproduced on different devices.

Description

Technical field [0001] The present invention relates to a machine translation model training method, specifically a machine translation method applied to a stable deep machine translation model. Background [0002] Machine Translation is a classic task in the field of natural language processing, the goal of which is to translate the source language (the language to be translated) into the target language (the translated language) through a computer. In today's interconnected world, the meaning of language is particularly important, including politics, business, education, medical and other fields, language is an irreplaceable medium for information transmission. In the face of the massive amount of data in today's Internet, it is difficult for humans to analyze this information by themselves, so machine translation technology as a powerful means of translation has been widely accepted. [0003]Although as early as the 17th century, scholars have proposed the idea of using machin...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F40/58G06N3/04G06N3/08
CPCG06F40/58G06N3/08G06N3/045
Inventor 刘兴宇姜炎宏
Owner 沈阳雅译网络技术有限公司