Speech synthesis method and device

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
A speech synthesis and parameter technology, applied in speech synthesis, speech analysis, instruments, etc., can solve problems such as difficulty in learning fundamental frequency trends, rhythm of synthesized speech, and lack of expressiveness.

Active Publication Date: 2019-09-03

BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD

View PDF8 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0003] In traditional speech synthesis systems, fundamental frequency modeling uses a multi-space probability distribution hidden Markov model (multi-space probability distribution HMM, MSD-HMM) modeling method, which can be very good for state-level, The fundamental frequency contour (or trend) of the consonant and final level is modeled, but it is difficult to learn the higher-level fundamental frequency trend of words, phrases or sentences, which makes the rhythm and expressiveness of the synthesized speech insufficient

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0025] Embodiments of the present invention are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals denote the same or similar modules or modules having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary only for explaining the present invention and should not be construed as limiting the present invention. On the contrary, the embodiments of the present invention include all changes, modifications and equivalents coming within the spirit and scope of the appended claims.

[0026] figure 1 It is a schematic flowchart of a speech synthesis method proposed by an embodiment of the present invention. The process of this embodiment takes the synthesis process as an example. see figure 1 , the method includes:

[0027] S11: Perform text feature extraction on the text to be synthesized to obtain contextual feature information.

[0028] The process o...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a voice synthesis method and a device. The voice synthesis method comprises steps of performing text characteristic extraction on a text to be synthesized to obtain the context characteristic information, obtaining a pre-generated model, wherein the pre-generated model is generated by training according to the context characteristic information of the training sample and converted acoustic parameter, and the converted acoustic parameters comprise a plurality of rhythm level fundamental frequency parameters, determining the model output parameter corresponding to the context characteristic information according to the model, wherein the model output parameters comprise a plurality of the rhythm level fundamental frequency parameters, performing the fundamental frequency reconstruction on the plurality of rhythm level fundamental frequency parameter, and synthesizing voice according to the parameter after the fundamental frequency reconstruction and the other parameters in the model output parameters. The method can improve the performance result of the synthesized speech.

Description

technical field [0001] The invention relates to the technical field of speech synthesis, in particular to a speech synthesis method and device. Background technique [0002] Now people are not only satisfied with the clarity and intelligibility of synthesized speech, but also require the synthesized speech to have better naturalness and expressiveness. In natural speech, fundamental frequency is the main factor affecting naturalness and expressiveness, so the accuracy of fundamental frequency modeling directly affects the naturalness and expressiveness of synthesized speech. [0003] In traditional speech synthesis systems, fundamental frequency modeling uses a multi-space probability distribution hidden Markov model (multi-space probability distribution HMM, MSD-HMM) modeling method, which can be very good for state-level, The fundamental frequency contour (or trend) of the consonant and final level is modeled, but it is difficult to learn the higher-level fundamental freq...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityPatents(China)

IPC IPC(8): G10L13/02G10L13/033G10L13/047G10L13/10

Inventor盖于涛康永国张少飞

OwnerBAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD

Speech synthesis method and device

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology