Training method, device and electronic equipment for speech spectrum generation model
A technology for generating models and training methods, applied in speech synthesis, biological neural network models, speech analysis, etc. It can solve problems such as spectrum ambiguity, modeling cannot reflect the nature of spectrum, and inconsistent vocoder training and judgment. Clear sequence effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0030] Exemplary embodiments of the present application are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present application to facilitate understanding, and they should be regarded as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the application. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.
[0031] Spectrum generation technology is a very important part of speech synthesis technology. It realizes the conversion from text sequence to spectrum sequence, and uses spectrum sequence as a bridge to link the input text sequence with the final synthesized audio.
[0032] The spectrum generation technology in the prior art usually uses the Tacotron model. The Tac...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


