A method, system, device and storage medium for converting text into speech
A technology for converting text to speech, applied in speech analysis, speech synthesis, instruments, etc., can solve the problems of high cost, restricting the development of text-to-speech technology, scarcity, etc., and achieve the effect of low cost
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0049] In this embodiment, the principle of the method for converting text into speech is as follows figure 1 shown. The basic process is as follows: perform preprocessing such as amplitude normalization, trimming silence and spectral conversion of the speech to be migrated to obtain the linear spectrum or mel spectrum of the speech to be migrated, and then input the linear spectrum or mel spectrum into the speech style encoder. On the other hand, after preprocessing the test text such as sentence segmentation and word segmentation, it is input into the attention-based auto-encoding model, and the output from the auto-encoding model is obtained. Pronunciation coding: The style coding and pronunciation coding are spliced and input into the speech decoder, and the spectrum output by the speech decoder is obtained after processing, and then the spectrum is converted into the obtained speech.
[0050] refer to figure 1 , the voice style encoder is composed of a multi-layer two...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


