Unlock instant, AI-driven research and patent intelligence for your innovation.

Pronunciation error detection method, device and storage medium

A technology of misdetection and speech, which is applied in speech analysis, speech recognition, instruments, etc., can solve problems such as difficulty in building a pronunciation error detection system, and achieve the effect of improving performance

Active Publication Date: 2022-08-02
BEIJING LANGUAGE AND CULTURE UNIVERSITY
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] However, the error detection method mentioned in the above reference, in the absence of pronunciation training data, although the target language L2 corpus is obtained through a small-scale speech corpus, such as about 150 hours of pre-training, the error detection performance is improved. , but for pronunciation features such as Chinese learning foreign languages, with a large span of speech levels and significant acoustic differences, it is still difficult to build a robust and good pronunciation error detection system

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Pronunciation error detection method, device and storage medium
  • Pronunciation error detection method, device and storage medium
  • Pronunciation error detection method, device and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] Hereinafter, the present invention will be described in detail with reference to the accompanying drawings.

[0038]In the transfer learning theory of deep learning, for a certain signal, such as image text and speech, it is hoped that the model can extract a general representation that reflects the inherent structure of the signal through pre-training. In this way, different tasks in the same domain can benefit from this common representation. Specific to a specific task, it is feasible to directly use this general representation as a feature, or add task-specific modules to fine-tune the pre-trained model as a whole. A pretrained model refers to learning a general representation from a large-scale corpus and using it for downstream tasks.

[0039] The pre-training model wav2vec2.0 is an open-source pre-training model that belongs to the wav2vec series. The pre-trained model wav2vec2.0 achieves SOTA (stata-of-the-art, state-of-the-art or state-of-the-art) performance...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention provides a pronunciation error detection method, device and storage medium. The method includes constructing a voice pre-training model, and pre-training the voice pre-training model based on an unmarked voice corpus; A layer of randomly initialized fully connected layer is added to the model to obtain a fine-tuning pre-training model, and the fine-tuning pre-training model is trained using the labeled pronunciation bias data to obtain a pronunciation bias detection model; The detection model detects the learner's speech to obtain pronunciation bias information. The pronunciation error detection method, device and storage medium of the present invention, by constructing a speech pre-training model, fine-tuning the pre-training model, and using the pronunciation error detection model to detect the learner's speech to obtain pronunciation error information, make In the absence of pronunciation training data, the performance of the pronunciation error detection system can still be effectively improved.

Description

technical field [0001] The present invention relates to the technical field of speech recognition, and in particular, to a method, device and storage medium for detecting pronunciation errors. Background technique [0002] With the development of speech technology and the promotion of online learning, Computer-Aided Pronunciation Training (CAPT) has been used more and more in language teaching. Among them, automatic pronunciation error detection, as an important part of computer-assisted pronunciation teaching, is mainly used to detect learners' pronunciation errors, so as to help learners find and correct pronunciation problems in time in the process of second language learning. [0003] The main principle of pronunciation error detection technology is to obtain a pronunciation error detection system including all the phoneme sets in the target language L2 by training a large number of target language L2 (Second / Targetlanguage, L2) speech corpora. During detection, the cor...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/06G10L15/02G10L15/16G10L15/187G10L25/30G10L25/51
CPCG10L15/063G10L15/02G10L15/187G10L15/16G10L25/30G10L25/51G10L2015/025
Inventor 张劲松彭霖铠付凯奇解焱陆柯登峰
Owner BEIJING LANGUAGE AND CULTURE UNIVERSITY