Many-to-many voice conversion method based on beta-vae
A voice conversion, many-to-many technology, applied in voice analysis, instruments, etc., can solve the problems of insufficient representation ability and difficult expansion, achieve high-quality voice conversion, improve voice quality, and improve the effect of insufficient representation
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0031] Such as figure 1 As shown, the present embodiment provides a many-to-many voice conversion method based on beta-VAE under a non-parallel text condition, which is divided into two steps of training and conversion:
[0032] 1. Speaker voice training stage
[0033] 1.1 Obtain non-parallel training corpus. The speech library used here is VCC2018, which contains 8 source speakers (SF1, SF2, SM1, SM2, SF3, SF4, SM3, SM4) and 4 target speakers (TF1 , TF2, TM1, TM2). The non-parallel training corpus selected in this paper is 4 source speakers: SF3, SF4, SM3, SM4, and 4 target speakers TF1, TF2, TM1, TM2. Among them, S (source) represents the source speaker, T (target) represents the target speaker, F (female) represents female, and M (male) represents male. Since the goal of this paper is non-parallel speech conversion, the selected training corpus is also non-parallel. There are 4 source speakers: SF3, SF4, SM3, SM4, and 4 target speakers TF1, TF2, TM1, TM2. The content of...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


