Pronunciation error detection method, device and storage medium

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of misdetection and speech, which is applied in speech analysis, speech recognition, instruments, etc., can solve problems such as difficulty in building a pronunciation error detection system, and achieve the effect of improving performance

Active Publication Date: 2022-08-02

BEIJING LANGUAGE AND CULTURE UNIVERSITY

View PDF0 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0007] However, the error detection method mentioned in the above reference, in the absence of pronunciation training data, although the target language L2 corpus is obtained through a small-scale speech corpus, such as about 150 hours of pre-training, the error detection performance is improved. , but for pronunciation features such as Chinese learning foreign languages, with a large span of speech levels and significant acoustic differences, it is still difficult to build a robust and good pronunciation error detection system

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0037] Hereinafter, the present invention will be described in detail with reference to the accompanying drawings.

[0038]In the transfer learning theory of deep learning, for a certain signal, such as image text and speech, it is hoped that the model can extract a general representation that reflects the inherent structure of the signal through pre-training. In this way, different tasks in the same domain can benefit from this common representation. Specific to a specific task, it is feasible to directly use this general representation as a feature, or add task-specific modules to fine-tune the pre-trained model as a whole. A pretrained model refers to learning a general representation from a large-scale corpus and using it for downstream tasks.

[0039] The pre-training model wav2vec2.0 is an open-source pre-training model that belongs to the wav2vec series. The pre-trained model wav2vec2.0 achieves SOTA (stata-of-the-art, state-of-the-art or state-of-the-art) performance...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The present invention provides a pronunciation error detection method, device and storage medium. The method includes constructing a voice pre-training model, and pre-training the voice pre-training model based on an unmarked voice corpus; A layer of randomly initialized fully connected layer is added to the model to obtain a fine-tuning pre-training model, and the fine-tuning pre-training model is trained using the labeled pronunciation bias data to obtain a pronunciation bias detection model; The detection model detects the learner's speech to obtain pronunciation bias information. The pronunciation error detection method, device and storage medium of the present invention, by constructing a speech pre-training model, fine-tuning the pre-training model, and using the pronunciation error detection model to detect the learner's speech to obtain pronunciation error information, make In the absence of pronunciation training data, the performance of the pronunciation error detection system can still be effectively improved.

Description

technical field [0001] The present invention relates to the technical field of speech recognition, and in particular, to a method, device and storage medium for detecting pronunciation errors. Background technique [0002] With the development of speech technology and the promotion of online learning, Computer-Aided Pronunciation Training (CAPT) has been used more and more in language teaching. Among them, automatic pronunciation error detection, as an important part of computer-assisted pronunciation teaching, is mainly used to detect learners' pronunciation errors, so as to help learners find and correct pronunciation problems in time in the process of second language learning. [0003] The main principle of pronunciation error detection technology is to obtain a pronunciation error detection system including all the phoneme sets in the target language L2 by training a large number of target language L2 (Second / Targetlanguage, L2) speech corpora. During detection, the cor...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G10L15/06G10L15/02G10L15/16G10L15/187G10L25/30G10L25/51

CPCG10L15/063G10L15/02G10L15/187G10L15/16G10L25/30G10L25/51G10L2015/025

Inventor 张劲松彭霖铠付凯奇解焱陆柯登峰

Owner BEIJING LANGUAGE AND CULTURE UNIVERSITY

Pronunciation error detection method, device and storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology