Speech recognition method and device and terminal
A speech recognition and speech signal technology, applied in the electronic field, can solve problems such as insufficient formant characteristics, low computational complexity, and no consideration of human auditory characteristics
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0047] An embodiment of the present invention provides a speech recognition method, including:
[0048] S101. Acquire a frame of speech signal, and extract d-dimensional MFCC parameters from the speech signal; the value range of d is a positive integer, and generally d=24;
[0049] S102. Perform cepstrum calculation on the d-dimensional MFCC parameters to obtain d-dimensional cepstrum MFCC parameters;
[0050] S103. Perform iterative processing on the cepstrum MFCC parameters in each dimension according to the preset number of iterations to obtain d-dimensional iterative cepstrum MFCC parameters;
[0051] S104. Identify the speech signal based on the d-dimensional iterative cepstrum MFCC parameters.
[0052] The embodiment of the present invention realizes enhancing the anti-noise performance of speech recognition in the feature space, and iterates the traditional MFCC parameters through cepstrum calculation to obtain the dynamic change trajectory of the MFCC parameter featur...
Embodiment 2
[0101] The present invention provides a voice recognition device, which is the device embodiment of Embodiment 1, including:
[0102] Parameter extraction module 30, is used for obtaining a frame speech signal, extracts d dimension MFCC parameter from described speech signal;
[0103] The cepstrum module 32 is used for performing cepstrum calculation to the d-dimensional MFCC parameters to obtain d-dimensional cepstrum MFCC parameters;
[0104] The iterative module 34 is used to iteratively process the cepstrum MFCC parameters of each dimension according to the preset number of iterations to obtain d-dimensional iterative cepstrum MFCC parameters;
[0105] The identification module 36 is configured to identify the speech signal based on the d-dimensional iterative cepstrum MFCC parameters.
[0106] The embodiment of the present invention realizes enhancing the anti-noise performance of speech recognition in the feature space, and iterates the traditional MFCC parameters throu...
Embodiment 3
[0120] An embodiment of the present invention provides a terminal, where the terminal includes the speech recognition device described in Embodiment 2. The terminal in the embodiment of the present invention specifically refers to a terminal with a voice recognition function, including a mobile phone, a tablet computer, a PDA (Personal Digital Assistant), a notebook computer, and the like.
[0121] In the specific implementation process of the embodiments of the present invention, refer to Embodiments 1 and 2, which have the technical effects of Embodiments 1 and 2, and will not be repeated here.
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com