Pronunciation error detection method, device and storage medium
A technology of misdetection and speech, which is applied in speech analysis, speech recognition, instruments, etc., can solve problems such as difficulty in building a pronunciation error detection system, and achieve the effect of improving performance
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0037] Hereinafter, the present invention will be described in detail with reference to the accompanying drawings.
[0038]In the transfer learning theory of deep learning, for a certain signal, such as image text and speech, it is hoped that the model can extract a general representation that reflects the inherent structure of the signal through pre-training. In this way, different tasks in the same domain can benefit from this common representation. Specific to a specific task, it is feasible to directly use this general representation as a feature, or add task-specific modules to fine-tune the pre-trained model as a whole. A pretrained model refers to learning a general representation from a large-scale corpus and using it for downstream tasks.
[0039] The pre-training model wav2vec2.0 is an open-source pre-training model that belongs to the wav2vec series. The pre-trained model wav2vec2.0 achieves SOTA (stata-of-the-art, state-of-the-art or state-of-the-art) performance...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


