Standardized sampling method for extracting pathological speech MFCC features for artificial intelligence analysis
A technology of artificial intelligence and voice, applied in the field of intelligent recognition, to achieve the effect of improving objectivity and efficiency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0053]In speech recognition (Speech Recognition) and voiceprint recognition (Voice Print Recognition), the most commonly used speech feature is Mel-scale Frequency Cepstral Coefficients (MFCC). The human ear has different hearing sensitivities to sound waves of different frequencies. Speech signals from 200Hz to 5000Hz have the greatest impact on speech intelligibility. The critical bandwidth due to sound masking in the low frequency domain is smaller at higher frequencies. Therefore, 28 band-pass filters are arranged from dense to sparse according to the critical bandwidth from low frequency to high frequency to filter the input signal. Taking the signal energy output by each bandpass filter as the basic feature of the signal, this acoustic feature based on the characteristics of the human ear is MFCC. The shape of the human vocal tract can be presented in the form of a short-term power spectrum envelope, and MFCC can accurately represent this envelope, that is, use acousti...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com