End-to-end Blind Speech Enhancement with Atrous Causal Convolutional Generative Adversarial Networks
A network-side, convolutional technology, applied in speech analysis, instruments, etc., can solve problems such as poor hearing and perception, lack of high-frequency components in bone conduction speech, etc., to achieve increased computing costs, easy reconstruction, and good performance enhancement Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0034] The technical solution to realize the object of the present invention is: a kind of generation confrontational network structure based on hole causal convolution for end-to-end speech enhancement, which is different from the existing large Most enhancement methods are different. It directly uses bone conduction original audio sampling points as input data, pure air conduction original audio as the output target of training, and constructs and trains the hole causal convolution to generate an adversarial enhancement network.
[0035] The atrous causal convolution generation adversarial network includes a generator and a discriminator. The generator uses atrous causal convolution to perform deep meaning feature extraction and feature transformation on the input data of the network, and outputs enhanced samples; the discriminator is an input The original audio data and the enhanced speech samples generated by the generator use the convolution layer in the discriminator to e...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


