Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

End-to-end intelligent voice reading evaluation method

A technology of intelligent speech and evaluation methods, applied in speech analysis, speech recognition, neural learning methods, etc., can solve problems such as the relationship between evaluation quality and recognition results, difficulty in evaluation errors, and application limitations, and achieve accurate extraction of evaluation-related features, Simplified training process and accurate evaluation results

Active Publication Date: 2022-06-21
浙江大学绍兴微电子研究中心 +2
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the current technology related to speech evaluation needs to align the speech to be recognized with the text of interest, and often needs to extract features based on the hidden Markov chain model, which undoubtedly makes the deployment of the speech evaluation technology more complicated and makes the technology applications are limited
However, the current deep learning-based speech recognition (ASR) evaluation method needs to first recognize the evaluated speech, and then align the recognition result with the text to be evaluated to obtain the evaluation result, resulting in a relationship between the evaluation quality and the recognition result.
In such a case, the user's accent has a great influence on the evaluation results; on the other hand, the recognition results need to be aligned with the evaluation text, and it is impossible to achieve a complete end-to-end output error position. The recognition results can only be optimized indirectly, and cannot be directly optimized. evaluation results
Therefore, some evaluation errors that are not caused by recognition (such as pitch, rhythm, etc.) are difficult to be correctly judged based on this method of recognition

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • End-to-end intelligent voice reading evaluation method
  • End-to-end intelligent voice reading evaluation method
  • End-to-end intelligent voice reading evaluation method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0017] In order to better illustrate the innovation of this method, the following is a brief description of the existing speech evaluation methods. Existing speech evaluation methods include methods based on hidden Markov chains (HMM) and methods based on deep learning speech recognition.

[0018] Align the recognition model based on the hidden Markov chain, and then judge the reading situation combined with the state in the model. The training and reasoning process usually includes the following steps:

[0019] - Extract speech signal features from training samples of (speech, text), including noise reduction preprocessing, normalization and other processes, usually extracting multi-dimensional Mel Frequency Cepstrum Coefficient (MFCC) or Mel scale Filter bank (Melscale Filter Bank, FBank) and other characteristics.

[0020] - Convert each word in the text into a phoneme and build a Hidden Markov Model (HMM) model according to a language-dependent dictionary. These phonemes...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an end-to-end intelligent voice reading evaluation method, which includes: collecting the data processing flow of reading voice, target pronunciation, error code, and speaker information used for training and evaluating the neural network; Auxiliary training; according to the target pronunciation to be evaluated and the pronunciation to be evaluated, the evaluation results are directly output from the neural network end-to-end, from the input of the target pronunciation and the pronunciation to be evaluated to the output of the evaluation results, the entire process can be differentiated, and can be directly targeted at the evaluation indicators optimization. The present invention directly constructs an end-to-end evaluation mode in which the input is speech and text to be evaluated, and the output is feedback results. It can be jointly trained with each module in the method to make the overall effect better. In addition, the auxiliary tasks constructed by the method can more accurately extract Evaluate related features to make feedback evaluation results more accurate.

Description

technical field [0001] The present invention relates to the technical field of voice evaluation, in particular to a voice evaluation method and system. Background technique [0002] With the development of speech recognition technology in artificial intelligence, people have begun to focus on how to use speech recognition technology in education-related scenarios. However, the traditional recognition evaluation method based on the hidden Markov chain model requires that the language to be recognized includes many stages of engineering processing, and requires more complex modeling. The traditional recognition method needs to split the word into phonemes and consider the context of the phoneme to form a triphone model, then construct the Markov chain corresponding to these triphone models, and estimate the path corresponding to the hidden Markov chain model from the sample probability, and finally get the recognition result. To improve these processes, domain experts need t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L25/51G10L15/26G10L15/02G10L15/16G10L15/06G06N3/04G06N3/08
CPCG10L25/51G10L15/02G10L15/16G10L15/063G06N3/084G10L2015/025G06N3/045
Inventor 张展王曰海
Owner 浙江大学绍兴微电子研究中心
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products