Method for performing pronunciation error detecting based on holding vector machine
A support vector machine and error detection technology, applied in speech analysis, speech recognition, instruments, etc., can solve the problem of sparse manual annotation, achieve good generalization, solve the problem of sparse manual annotation, and achieve good performance.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0050] Join shown in accompanying drawing 1~6.
[0051] The specific implementation steps of the method for mispronunciation detection using support vector machines are:
[0052] 1. The construction of the speech recognition system, the steps are as follows:
[0053] (1) Collect and train recognizer speech: according to the application needs of language learning, collect or record targeted standard pronunciation corpus in advance, and save them as recognizer training speech files, such as recording standard Mandarin speakers for the Mandarin Chinese proficiency test Putonghua proficiency test corpus;
[0054] (2) Data annotation: Pinyin annotation is performed on the collected standard corpus, so that the collected corpus is targeted for speech evaluation;
[0055] (3) Model training: use HTK to train HMM-based phoneme-level (27 initials, including zero initials, 37 finals) speech recognizer model based on collected standard corpus
[0056] (4) preservation: the model is pr...
Embodiment 2
[0088] The specific implementation steps of the method for mispronunciation detection using support vector machines are:
[0089] 1. The steps to build the speech recognition system are as follows:
[0090] (1) Collect or record standard pronunciation corpus in advance, and save it as a recognizer training voice file;
[0091] (2) Carry out pinyin labeling for the collected standard corpus;
[0092] (3) Model training: train the phoneme-level speech recognizer model according to the collected standard corpus;
[0093] (4) Save the speech recognizer in the library of the computer-aided language learning system.
[0094] 2. Pronunciation error detection feature extraction, the steps are: firstly use the text of the evaluated corpus to segment the pronunciation and calculate the logarithmic likelihood of the target text, denoted as likelihood T , and then, on the boundary obtained by segmentation, calculate the logarithmic likelihood of this segment to all other models in the ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com