A Wavenet-Based Bone Conduction Speech Enhanced Waveform Generation Method
A waveform generation and voice enhancement technology, applied in the field of bone conduction, can solve problems such as unsatisfactory conversion effect, difficult modeling, voice problems, etc., to achieve good spectrum expansion function and improve quality.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0100] The following is a detailed introduction to WaveNet-based waveform generation:
[0101] WaveNet is a fully probabilistic autoregressive generation model. By constructing a special deep convolutional neural network structure, it realizes the direct modeling of the speech waveform level. It usually needs to give additional input conditions to guide the generation of speech waveforms with specific properties. .
[0102] Let the speech waveform sequence be x={x 1 ,···,x t-1}, then its joint probability density distribution under the conditional feature λ can be expressed as the product of the following conditional probabilities:
[0103]
[0104] WaveNet uses PixelCNN to realize the calculation of the probability distribution of formula (1) by stacking well-designed convolutional layers, and uses deep residual network and parameterized skip connection to construct a deeper network structure and achieve fast convergence of the model.
[0105] The method of the present ...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


