Unlock instant, AI-driven research and patent intelligence for your innovation.

Sound selection method and device for waveform splicing speech synthesis

A technology of speech synthesis and waveform splicing, which is applied in speech synthesis, speech analysis, instruments, etc., can solve the problems of cumbersome project implementation, full consideration of information synthesis, and high code complexity, so as to improve the effect of pre-selection.

Active Publication Date: 2019-07-30
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] (1) Each pre-selection process is independent of each other, and the information is not integrated and fully considered, so it is difficult to obtain a good pre-selection effect;
[0007] (2) The above-mentioned pre-selection process needs to adjust the threshold and weight, and the work of adjusting the threshold and weight requires a lot of meticulous manual work, and it is easy to lose sight of the other. After adjusting the threshold and weight for a sound bank, it is often necessary to readjust these parameter;
[0008] (3) Multi-step pre-selection is required, and the amount of calculation is large (especially KLD pre-selection);
[0009] (4) The engineering implementation of this method is relatively cumbersome, involving the maintenance of a large number of parameters, the code complexity is high, and it is difficult to maintain

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Sound selection method and device for waveform splicing speech synthesis
  • Sound selection method and device for waveform splicing speech synthesis
  • Sound selection method and device for waveform splicing speech synthesis

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] Embodiments of the present invention are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals denote the same or similar modules or modules having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary only for explaining the present invention and should not be construed as limiting the present invention. On the contrary, the embodiments of the present invention include all changes, modifications and equivalents coming within the spirit and scope of the appended claims.

[0026] figure 1 It is a schematic flowchart of a sound selection method for waveform splicing speech synthesis proposed by an embodiment of the present invention. see figure 1 , the method includes:

[0027] S11: Obtain annotation information, which is obtained after front-end processing of the text to be synthesized.

[0028] Among them, the front-end processing mainly inc...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a voice selection method and a device used for waveform splicing of voice synthesis. The voice selection method comprises steps of obtaining annotated information which is obtained via front end processing for a to-be-synthesized text; obtaining a pre-generated machine learning model; and according to the annotated information and the robot learning model, carrying out machine learning pre-selection so as to obtain sub-waveform fragments of a candidate voice. Thus, pre-selection effects during voice synthesis can be improved.

Description

technical field [0001] The invention relates to the technical field of speech synthesis, in particular to a sound selection method and device for waveform splicing speech synthesis. Background technique [0002] Speech synthesis, also known as text-to-speech (Text to Speech) technology, solves the main problem of how to convert text information into audible sound information. [0003] In speech synthesis, it is necessary to perform front-end processing on the input text first, then predict the acoustic parameters to obtain the acoustic parameters, and finally use the acoustic parameters to directly synthesize the sound through the vocoder, or select units from the sound library for waveform splicing. Compared with the voice synthesized by the vocoder, the synthesized voice based on waveform splicing has higher sound quality and better maintains the style of the original speaker. [0004] In the process of constructing a speech synthesis system based on waveform splicing, in...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L13/10G10L13/033
CPCG10L13/033G10L13/10
Inventor 张辉李秀林
Owner BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD