Speech enhancement method based on time-frequency domain generative adversarial network
A speech enhancement, time-frequency domain technology, applied in speech analysis, instruments, etc., can solve the problem of ignoring the frequency domain characteristics of speech and noise
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment approach
[0111] Further, the specific implementation plan is as follows:
[0112] The training of the generated confrontation network is a cross-training process, which is divided into:
[0113] Step 4.1, the voice of the training set, the voice of the training set includes clean original voice and noisy original voice, the voice of the training set is divided into frames and samples to obtain clean voice x and noisy voice x c . Wherein, the frame length of sub-framing is N=16384, the frame shift is M=10ms, and the sampling rate is S=16kHz;
[0114] Step 4.2, short-time Fourier transform (STFT) is performed on the speech of the training set to obtain the frequency-domain amplitude spectrum X and X of the clean speech and the noisy speech c . Among them, the window function adopted by STFT is Hamming window, the window length is N, and the sampling rate is S. The standard short-time Fourier transform is shown in Equation 4.
[0115]
[0116] Among them, n is time, x(n) is time d...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


