Pronunciation bias error detection method and device and storage medium
A technology of misdetection and speech, which is applied in speech analysis, speech recognition, instruments, etc., can solve problems such as difficulty in building a pronunciation error detection system, and achieve the effect of improving performance
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0037] Below, the present invention will be described in detail with reference to the accompanying drawings.
[0038]In the transfer learning theory of deep learning, for a certain signal, such as image text speech, it is hoped that through pre-training, the model can extract a general representation that reflects the internal structure of the signal. In this way, different tasks in the same domain can benefit from this common representation. Specific to specific tasks, it is feasible to directly use this general representation as a feature, or add task-specific modules to perform overall fine-tuning on the pre-trained model. The pre-training model refers to learning a general representation from a large-scale corpus and using it for downstream tasks.
[0039] The pre-training model wav2vec2.0 is an open source pre-training model that belongs to the wav2vec series. The pre-trained model wav2vec2.0 achieves SOTA (stata-of-the-art, frontier or highest level) performance on mul...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


