Text information-based waveform concatenation voice synthesizing method
A waveform splicing and text information technology, applied in speech synthesis, speech analysis, instruments, etc., can solve problems such as unsatisfactory stability of synthesized speech, poor speech rhythm performance, and lack of consideration of the influence of text information, so as to enhance real-time performance. , high naturalness, the effect of reducing the number of
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0035] Such as figure 1 As shown, the flow chart of the waveform splicing speech synthesis method for text information, the method includes the following steps:
[0036] Step S1: Extract the acoustic parameters and text parameters of all primitives in the original audio through segment segmentation, and train the weight prediction model and duration prediction model according to the extracted parameters;
[0037] The model training module performs model training according to the text parameters and acoustic parameters of the training text and the corresponding audio extraction primitive, and obtains the time length prediction model and the weight prediction model required for the calculation of the target cost in the hierarchical preselection;
[0038] Such as figure 2 As shown, the training duration prediction model includes the following steps:
[0039] Step S11: Carry out segment segmentation (primary segmentation) on the original sound bank, and segment it into the mini...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 
