Method for improving naturalness of speech synthesis
A speech synthesis and natural technology, applied in speech synthesis, speech analysis, instruments, etc., can solve the problems of fewer models and the loss of naturalness of synthesized speech, and achieve the effects of real human pronunciation, reduced complexity, and saving computing and deployment costs
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0018] The present invention will be further described below in conjunction with the accompanying drawings and specific embodiments.
[0019] Such as figure 1 In the described embodiment, a method for improving the naturalness of speech synthesis specifically includes the following steps:
[0020] (1) Text encoding: the text is obtained from the phoneme corresponding to the text through the tool from the grapheme to the phoneme, and then all the phonemes form a phoneme dictionary, and the number of the phoneme dictionary is used as the dimension of the embedding layer to characterize the phoneme of the text, that is, through Embedding in deep learning maps phonemes to a feature vector;
[0021] (2) The represented features are encoded by the CBHG module. The represented features refer to the feature vectors in deep learning. Coding refers to mapping the represented features to another feature vector through the CBHG module; the CBHG module consists of a one-dimensional volume...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 
