Extracting method of MFCC coefficients of voice signal, device and Mel filtering method
A speech signal and coefficient technology, applied in speech analysis, speech recognition, instruments, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0056] refer to figure 1 , is a flowchart of a method for extracting MFCC coefficients of a speech signal described in Embodiment 1.
[0057] S101, when performing Mel filtering, increase the number of subbands of the Mel filter bank, perform Mel filtering in the frequency range, and obtain a Mel filtering output corresponding to each subband;
[0058] That is, the original dimension of the Mel filter (that is, the number of subbands) is extended, and then the signal in the full frequency band is filtered. In this way, according to the mapping relationship between the Mel frequency and the linear frequency, the number of sub-bands in the low-frequency range on the signal frequency band (ie, the linear frequency band) is correspondingly increased, thereby ensuring sufficient frequency resolution accuracy for low-frequency signals. But at the same time, the number of sub-bands in the high-frequency range also increases accordingly. Since high-frequency signals are susceptible t...
Embodiment 2
[0070] The present invention is mainly applied to broadband signal processing with a frequency range of 0-16kHz, because the 16kHz broadband signal can basically meet the feature information required for speech recognition. The following will take a 16kHz broadband signal as an example to describe in detail. Among them, 0-8k is the low frequency range, and 8k-16k is the high frequency range. Of course, the present invention is not limited to the frequency range of 0-16 kHz.
[0071] refer to figure 2 , is a flowchart of a method for extracting MFCC coefficients of a speech signal described in Embodiment 2.
[0072] S201, voice enhancement processing;
[0073] In this embodiment, speech enhancement processing is performed on signals in the range of 16 kHz at the same time. The purpose of speech enhancement is to extract the original speech as pure as possible from the noisy speech signal. Currently, there are many enhancement algorithms commonly used, such as spectral subt...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 