Unlock instant, AI-driven research and patent intelligence for your innovation.

Speech synthesis method and device, equipment and computer readable storage medium

A speech synthesis and speech technology, applied in speech synthesis, speech analysis, instruments, etc., can solve problems such as difficulty in defining speech, lack of style control, and inability to synthesize speech with precise expression.

Pending Publication Date: 2019-09-27
PING AN TECH (SHENZHEN) CO LTD
View PDF0 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Style contains a wealth of information, such as intent, emotion, and the intonation and voice flow that affect the speaker, so it is difficult to accurately define the style of speech, and the current TTS system and end-to-end speech synthesis system can only learn one input The average prosodic distribution of the data, without style control, cannot synthesize speech with precise expressive power for longer text sentences

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech synthesis method and device, equipment and computer readable storage medium
  • Speech synthesis method and device, equipment and computer readable storage medium
  • Speech synthesis method and device, equipment and computer readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] The following will clearly and completely describe the technical solutions in the embodiments of the present application with reference to the drawings in the embodiments of the present application. Obviously, the described embodiments are part of the embodiments of the present application, not all of them. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the scope of protection of this application.

[0028] The flow charts shown in the drawings are just illustrations, and do not necessarily include all contents and operations / steps, nor must they be performed in the order described. For example, some operations / steps can be decomposed, combined or partly combined, so the actual order of execution may be changed according to the actual situation.

[0029] Embodiments of the present application provide a speech synthesis method, device, computer equipment, and co...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a speech synthesis method and device, equipment and a computer readable storage medium, and relates to speech synthesis. The speech synthesis method comprises the steps: a reference speech sequence is determined, and a speech synthesis model and the target text vector corresponding to a to-be-synthesized target text sequence are obtained; the reference speech sequence is encoded through a reference encoder, and a target reference embedding vector corresponding to the reference speech sequence is obtained; the target reference embedding vector is subjected to style marking through a style mark layer, and a target style embedding vector corresponding to the reference speech sequence is obtained; and through a speech synthesis layer, based on the target text vector and the target style embedding vector, speech synthesis operation is executed, and target speech is obtained. The speech is synthesized jointly through the target style embedding vector and the target text vector, the speech rhythm synthetic speech expressed according to the target style embedding vector can be obtained, and the expressive power accuracy of the synthetic speech can be effectively improved.

Description

technical field [0001] The present application relates to the technical field of speech synthesis, and in particular to a speech synthesis method, device, equipment and computer-readable storage medium. Background technique [0002] With the rapid development of the TTS (Text To Speech, from text to speech) system, there are more and more scenarios using the TTS system, such as audio book reading, news reading, and chat assistants. Among them, the neural network model has the ability to synthesize Expressiveness affects the performance of speech, but in order to synthesize more human-like speech, neural network models must learn prosody, which is the combination of a set of phenomena in speech, such as paralinguistic information, intonation, accent, and style. [0003] Style contains a wealth of information, such as intent, emotion, and the intonation and voice flow that affect the speaker, so it is difficult to accurately define the style of speech, and the current TTS syst...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L13/08G10L13/02
CPCG10L13/08G10L13/02
Inventor 王健宗孙奥兰彭话易程宁
Owner PING AN TECH (SHENZHEN) CO LTD