Unlock instant, AI-driven research and patent intelligence for your innovation.

Speech translation model training method and system based on model enhancement and speech translation method and device

A technology for speech translation and model training, applied in the field of speech translation, can solve the problems of increasing the difficulty of coding and alignment, and requiring a large amount of data, and achieves the effect of increasing the difficulty of alignment, simplifying implementation and improving the performance of ST models.

Active Publication Date: 2021-10-15
PLA STRATEGIC SUPPORT FORCE INFORMATION ENG UNIV PLA SSF IEU +1
View PDF9 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Aiming at the problem that the existing speech translation model requires a large amount of data, the present invention proposes a speech translation model training method, system, and speech translation method and device based on ModAugment, by increasing the alignment difficulty of the encoding and decoding parts , transforming the overfitting problem into an underfitting problem, after a longer training period, an overall more robust model can be obtained, and better results can be achieved with a smaller hyperparameter adjustment burden

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech translation model training method and system based on model enhancement and speech translation method and device
  • Speech translation model training method and system based on model enhancement and speech translation method and device
  • Speech translation model training method and system based on model enhancement and speech translation method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] The present invention will be further explained below in conjunction with accompanying drawing and specific embodiment:

[0036] Such as figure 1 As shown, the present invention proposes a kind of speech translation model training method based on model enhancement on the one hand, comprising:

[0037] Step S101: collecting a speech translation data set, the speech translation data set is composed of a plurality of speech-translation-transcription triplets;

[0038] Step S102: Utilize the speech-transcription data pair in the speech translation data set to train the speech recognition model, and utilize the transcription-translation data pair in the speech translation data set to train the machine translation model;

[0039] Step S103: Initialize the encoding layer of the speech translation model with the speech recognition model, and initialize the decoding layer of the speech translation model with the machine translation model;

[0040] Step S104: masking the hidden...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a speech translation model training method and system based on model enhancement and a speech translation method and device, and the training method comprises the steps of collecting a speech translation data set which is composed of a plurality of speech-translation-transcription triples; training a speech recognition model by using the speech-transcription data pair in the speech translation data set, and training a machine translation model by using the transcription-translation data pair in the speech translation data set; initializing a coding layer of the speech translation model by using the speech recognition model, and initializing a decoding layer of the speech translation model by using the machine translation model; masking the output of a hidden layer of the speech translation model, and training the speech translation model by using the speech translation data set in combination with a loss function; and after the speech translation model is trained, removing the mask, and performing fine tuning on the trained speech translation model. According to the invention, the recognition performance of the speech translation model is improved, and the speech translation efficiency and quality can be effectively improved.

Description

technical field [0001] The invention belongs to the technical field of speech translation, and in particular relates to a speech translation model training method and system based on model enhancement, and a speech translation method and device. Background technique [0002] Speech translation is an end-to-end model that converts speech in one language into text in another language, that is, skips the step of converting the source language into text in the traditional model, and directly converts the speech in the source language into the target Language text is a hot research topic at present. In terms of model selection, the Transformer model proposed by Google (A.Vaswani, N.Shazeer, N.Parmar, J.Uszkoreit, L.Jones, A.N.Gomez, L.Kaiser, and I.Polosukhin, "Attention is all you need, "in Proc.NIPS, California, USA, 2017, pp.5998–6008.) Due to the effectiveness of its self-attention mechanism modeling, the efficiency of parallel processing and the simplicity of the model stru...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/58G06N3/04G06N3/08G10L15/06G10L15/22
CPCG06F40/58G06N3/08G10L15/063G10L15/22G06N3/045
Inventor 屈丹张昊杨绪魁张文林闫红刚牛铜何振华陈琦
Owner PLA STRATEGIC SUPPORT FORCE INFORMATION ENG UNIV PLA SSF IEU