Voice synthesis method and device, computer readable medium and electronic equipment
A technology of speech synthesis and speech features, applied in speech synthesis methods, computer-readable media, electronic equipment, and devices, which can solve problems such as increasing the difficulty of text content, not smooth enough language conversion, and inability to stop
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
preparation example Construction
[0042] figure 2 It is a flowchart of a speech synthesis method according to another exemplary embodiment. Such as figure 2 As shown, the above method may further include the following step 103 .
[0043] In step 103, the prosodic representation of the target speaker is obtained.
[0044] In the present disclosure, the above-mentioned target reader may be a default reader, or may be a reading value set by the user. The prosodic representations described above can be used to indicate pitch and volume changes. Moreover, the prosody representation of the target reader can be obtained in the following manner: first, obtain the first Mel spectrum feature information corresponding to any second audio information read by the target reader; after that, the first Mel spectrum feature information Input into the preset Variational Auto-Encoder (VAE) model to obtain the prosodic representation of the target speaker. Wherein, the above-mentioned VAE model is trained based on the Mel ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com