Speech emotion recognition method combining CGAN spectrogram denoising and bilateral filtering spectrogram enhancement
A technology of speech emotion recognition and bilateral filtering, applied in speech analysis, instruments, etc., can solve the problems of voice quality and emotional information degradation, and achieve the effect of balancing small details and strong edge enhancement effects
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0065] The technical solutions of the present invention will be further explained below through specific examples.
[0066] Such as figure 1 As shown, the speech emotion recognition method combined with CGAN spectrogram denoising and bilateral filter spectrogram enhancement in the embodiment of the present invention includes the following steps:
[0067] S1. Collect the voice emotion data set, and preprocess the voice emotion data set to obtain the spectrogram data set of the clean voice; also add noise to the voice to obtain the noise-added spectrogram data set after the clean voice is added with noise, That is, the spectrogram data set in the noise environment;
[0068] Specifically, each speech signal in the speech emotion data set is preprocessed by framing and windowing, and then short-time discrete Fourier transform is performed to obtain the spectrum X(k):
[0069]
[0070] Wherein, N is the window length, x (n) is the voice signal, w (n) is the Hamming window func...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


