Unlock instant, AI-driven research and patent intelligence for your innovation.

Speech translation model training method and system based on model enhancement, and speech translation method and device

A speech translation and model training technology, which is applied in the field of speech translation, can solve the problems of large data volume requirements and increase the difficulty of coding alignment, and achieve the effect of simple implementation, increased alignment difficulty, and improved ST model performance

Active Publication Date: 2022-05-06
PLA STRATEGIC SUPPORT FORCE INFORMATION ENG UNIV PLA SSF IEU +1
View PDF9 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Aiming at the problem that the existing speech translation model requires a large amount of data, the present invention proposes a speech translation model training method, system, and speech translation method and device based on ModAugment, by increasing the alignment difficulty of the encoding and decoding parts , transforming the overfitting problem into an underfitting problem, after a longer training period, an overall more robust model can be obtained, and better results can be achieved with a smaller hyperparameter adjustment burden

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech translation model training method and system based on model enhancement, and speech translation method and device
  • Speech translation model training method and system based on model enhancement, and speech translation method and device
  • Speech translation model training method and system based on model enhancement, and speech translation method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] The present invention will be further explained below in conjunction with accompanying drawing and specific embodiment:

[0036] like figure 1 As shown, the present invention proposes a kind of speech translation model training method based on model enhancement on the one hand, comprising:

[0037] Step S101: collecting a speech translation data set, the speech translation data set is composed of a plurality of speech-translation-transcription triplets;

[0038] Step S102: Utilize the speech-transcription data pair in the speech translation data set to train the speech recognition model, and utilize the transcription-translation data pair in the speech translation data set to train the machine translation model;

[0039] Step S103: Initialize the encoding layer of the speech translation model with the speech recognition model, and initialize the decoding layer of the speech translation model with the machine translation model;

[0040] Step S104: masking the hidden la...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a speech translation model training method and system based on model enhancement, and a speech translation method and device. The training method includes: collecting a speech translation data set, and the speech translation data set consists of multiple speech-translation-transcription triplets Composition; use the speech-transcription data in the speech translation data set to train the speech recognition model, use the transcription-translation data in the speech translation data set to train the machine translation model; use the speech recognition model to initialize the coding layer of the speech translation model, and use the machine translation The model initializes the decoding layer of the speech translation model; masks the output of the hidden layer of the speech translation model, uses the speech translation data set and combines the loss function to train the speech translation model; after the speech translation model is trained, remove the mask, and The trained speech translation model is fine-tuned. The invention improves the recognition performance of the speech translation model, and can effectively improve the speech translation efficiency and quality.

Description

technical field [0001] The invention belongs to the technical field of speech translation, and in particular relates to a speech translation model training method and system based on model enhancement, and a speech translation method and device. Background technique [0002] Speech translation is an end-to-end model that converts speech in one language into text in another language, that is, skips the step of converting the source language into text in the traditional model, and directly converts the speech in the source language into the target Language text is a hot research topic at present. In terms of model selection, the Transformer model proposed by Google (A.Vaswani, N.Shazeer, N.Parmar, J.Uszkoreit, L.Jones, A.N.Gomez, L.Kaiser, and I.Polosukhin, "Attention is all you need, "in Proc.NIPS, California, USA, 2017, pp.5998–6008.) Due to the effectiveness of its self-attention mechanism modeling, the efficiency of parallel processing and the simplicity of the model stru...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/58G06N3/04G06N3/08G10L15/06G10L15/22
CPCG06F40/58G06N3/08G10L15/063G10L15/22G06N3/045
Inventor 屈丹张昊杨绪魁张文林闫红刚牛铜何振华陈琦
Owner PLA STRATEGIC SUPPORT FORCE INFORMATION ENG UNIV PLA SSF IEU