End-to-end speech recognition model training method and system

A speech recognition model and training method technology, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as high professional requirements, high labor costs, and long time, so as to avoid hyperparameter tuning and reduce training time Effect

Active Publication Date: 2021-09-17
AISPEECH CO LTD
View PDF16 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the disadvantage of these technologies is that a large number of hyperparameters need to be tuned for a specific model, which is not universal, and requires a long time of training, which is very time-consuming.
If these similar technologies want to achieve good results, users need to have rich experience in parameter adjustment, high professional requirements, and high labor costs.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • End-to-end speech recognition model training method and system
  • End-to-end speech recognition model training method and system
  • End-to-end speech recognition model training method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] In order to make the purposes, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments These are some embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

[0026] It should be noted that the embodiments in the present application and the features of the embodiments may be combined with each other in the case of no conflict.

[0027] The invention may be described in the general context of computer-executable instructions, such as program modules, being executed by a computer. Generall...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a training method for an end-to-end speech recognition model. The end-to-end speech recognition model includes an encoder and a decoder. The method includes: obtaining an acoustic model and a cross-entropy language model through pre-training; Model initialization of the encoder of the end-to-end speech recognition model; initialization of the decoder of the end-to-end speech recognition model according to the cross-entropy language model; training of the end-to-end speech recognition model after initialization. The multi-stage pre-training method avoids the long and slow learning phase in the early stage of model training, thereby greatly reducing the model training time. At the same time, this strategy has no hyperparameters to tune. Compared with the existing technology, it avoids a lot of tedious hyperparameter tuning.

Description

technical field [0001] The present invention relates to the technical field of speech recognition, in particular to a training method and system for an end-to-end speech recognition model. Background technique [0002] In order to train a good end-to-end system in the prior art, there are preheating and super-long learning rate scheduling strategies, but they are all equivalent to slowly increasing the learning intensity of the model. Continue training in the normal way. [0003] Essentially, warm-up and super-long learning rate scheduling are all about giving the model a good initial state. However, the disadvantage of these techniques is that a large number of hyperparameter tuning is required for a specific model, which is not universal, and requires a long time of training, which is very time-consuming. If these similar technologies want to achieve good results, users need to have rich experience in parameter adjustment, high professional requirements, and high labor c...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/06
CPCG10L15/063
Inventor 俞凯钱彦旻黄明坤卢怡宙王岚
Owner AISPEECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products