Sequence-to-sequence speech synthesis method and system for double-layer autoregressive decoding
A speech synthesis and autoregressive technology, applied in speech synthesis, speech analysis, neural learning methods, etc., can solve problems such as unstoppable, unsatisfactory attention mechanism robustness, and insufficient long-term correlation modeling ability, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0053] The technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only part of the embodiments of the present invention, not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.
[0054] According to an embodiment of the present invention, a sequence-to-sequence speech synthesis system with double-layer autoregressive decoding is proposed, including an encoder and a decoder. The structure of the encoder is the same as that of the Tacotraon2 model, and its decoder includes three modules of phoneme-level representation, phoneme-level prediction and frame-level prediction. Additionally, a total of four loss functions are propos...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


