Deep neural network (DNN) voice enhancement model based on MEE optimization criteria
A deep neural network and speech enhancement technology, applied in biological neural network models, speech analysis, speech recognition, etc., can solve problems such as unsatisfactory non-stationary noise effect, and achieve the effect of solving noisy speech noise reduction.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0034] Hereinafter, the preferred embodiments of the present invention will be described in detail with reference to the accompanying drawings.
[0035] Select 4620 pure speech and white noise, pink noise, Volvo noise and car noise from the TIMIT data set to add -5db, noisy speech with 5db signal-to-noise ratio as the training set. Another 200 pure voices are mixed with babble noise and factory noise under the same signal-to-noise ratio as the test set.
[0036] In the training stage, features are extracted from the training set speech, feature selection logarithmic power spectrum, and input to the MSE-DNN network and the MEE-DNN network proposed by the present invention for training.
[0037] After the network training is completed, extract the logarithmic power spectrum of the test set speech as well, and input it into two different DNN networks again to obtain the estimation of the logarithmic power spectrum of the pure speech, and use the overlap-addition method to reconstruct th...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com