Speech recognition method, device and terminal
A speech recognition and terminal technology, applied in the electronic field, can solve the problems of insufficient formant characteristics, low computational complexity, and failure to consider the characteristics of human hearing, and achieve the effect of improving the anti-noise performance
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0047] An embodiment of the present invention provides a speech recognition method, including:
[0048] S101. Acquire a frame of speech signal, and extract d-dimensional MFCC parameters from the speech signal; the value range of d is a positive integer, and generally d=24;
[0049] S102. Perform cepstrum calculation on the d-dimensional MFCC parameters to obtain d-dimensional cepstrum MFCC parameters;
[0050] S103. Perform iterative processing on the cepstrum MFCC parameters in each dimension according to the preset number of iterations to obtain d-dimensional iterative cepstrum MFCC parameters;
[0051] S104. Identify the speech signal based on the d-dimensional iterative cepstrum MFCC parameters.
[0052] The embodiment of the present invention realizes enhancing the anti-noise performance of speech recognition in the feature space, and iterates the traditional MFCC parameters through cepstrum calculation to obtain the dynamic change trajectory of the MFCC parameter featur...
Embodiment 2
[0101] The present invention provides a voice recognition device, which is the device embodiment of Embodiment 1, including:
[0102] Parameter extraction module 30, is used for obtaining a frame speech signal, extracts d dimension MFCC parameter from described speech signal;
[0103] The cepstrum module 32 is used for performing cepstrum calculation to the d-dimensional MFCC parameters to obtain d-dimensional cepstrum MFCC parameters;
[0104] The iterative module 34 is used to iteratively process the cepstrum MFCC parameters of each dimension according to the preset number of iterations to obtain d-dimensional iterative cepstrum MFCC parameters;
[0105] The identification module 36 is configured to identify the speech signal based on the d-dimensional iterative cepstrum MFCC parameters.
[0106] The embodiment of the present invention realizes enhancing the anti-noise performance of speech recognition in the feature space, and iterates the traditional MFCC parameters throu...
Embodiment 3
[0120] An embodiment of the present invention provides a terminal, where the terminal includes the speech recognition device described in Embodiment 2. The terminal in the embodiment of the present invention specifically refers to a terminal with a voice recognition function, including a mobile phone, a tablet computer, a PDA (Personal Digital Assistant), a notebook computer, and the like.
[0121] In the specific implementation process of the embodiments of the present invention, refer to Embodiments 1 and 2, which have the technical effects of Embodiments 1 and 2, and will not be repeated here.
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


