Speech de-noising method and speech de-noising device
A voice denoising, non-speech technology, applied in voice analysis, instruments, etc., can solve problems such as poor results
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0023] figure 1 The flow chart of the voice denoising method provided in Embodiment 1 of the present invention, this embodiment can be used for voice denoising, the method can be executed by a voice denoising device, and the device can be implemented by software and / or hardware, the The device can be integrated into any intelligent terminal that provides voice denoising function. In specific implementation, the intelligent terminal can include: mobile terminals such as tablets, mobile phones, and e-readers. The above terminals are only examples, not exhaustive, including but not exhaustive. Limited to the smart terminals mentioned above.
[0024] see figure 1 , the method for speech denoising, comprising:
[0025] S110. Perform speech detection on the noisy speech signal to distinguish speech frames from non-speech frames.
[0026] The speech signal received by the smart terminal is a non-stationary time-varying noisy speech signal formed after being disturbed by the enviro...
Embodiment 2
[0044] figure 2 The flow chart of the speech denoising method provided by Embodiment 2 of the present invention. On the basis of the above embodiments, this embodiment proposes an effective speech feature combination when performing speech detection on noisy speech signals, which can more accurately distinguish Separate noisy speech frames and non-speech frames.
[0045] see figure 2 , the method for speech denoising, comprising:
[0046] S210. Extract speech features of the noisy speech signal.
[0047] The extracted noisy speech features include Mel cepstral coefficient MFCC, linear predictive coding residual and spectral centroid Centroid. Human beings have different perceptual abilities to different frequencies of speech: below 1kHz, it has a linear relationship with the frequency, and above 1kHz, it has a logarithmic relationship with the frequency. The higher the frequency, the worse the perception. In applications, only low-frequency MFCCs are often used, while m...
Embodiment 3
[0077] image 3 It is a flow chart of the speech denoising method provided by Embodiment 3 of the present invention. On the basis of the foregoing embodiments, this embodiment performs stationary noise suppression, non-speech noise suppression, and non-stationary noise suppression on noisy speech signals.
[0078] see image 3 , the method for speech denoising, comprising:
[0079] S310. Perform speech detection on the noisy speech signal to distinguish speech frames from non-speech frames.
[0080] S320. Perform noise estimation on the speech frame and the non-speech frame respectively, to obtain a fusion estimation value of the noise power spectrum.
[0081] S330. Perform stationary noise suppression, non-speech noise suppression, and non-stationary noise suppression on the noisy speech signal according to the fused estimated value of the noise power spectrum.
[0082] Exemplarily, after the fused estimated value of the noise power spectrum is obtained, denoising processi...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com