Speech synthesis method, electronic equipment and storage device

A speech synthesis and audio technology, applied in the field of artificial intelligence, can solve the problems of intelligent customer service not being able to empathize with users, lack of emotion in speech synthesis, and reducing user experience.

Pending Publication Date: 2021-05-11
UNIV OF SCI & TECH OF CHINA +1
View PDF0 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the current speech synthesis lacks emotion. For example, in the interaction scene, the intelligent customer service cannot empathize with the user during the interaction process, which greatly reduces the user experience.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech synthesis method, electronic equipment and storage device
  • Speech synthesis method, electronic equipment and storage device
  • Speech synthesis method, electronic equipment and storage device

Examples

Experimental program
Comparison scheme
Effect test

preparation example Construction

[0020] see figure 1 , figure 1 It is a schematic flowchart of an embodiment of the speech synthesis method of the present application. Specifically, the following steps may be included:

[0021] Step S11: Obtain the text to be synthesized and the target emotion type of the text to be synthesized, and obtain the reference audio of the target emotion type.

[0022] In an implementation scenario, the text to be synthesized may be set according to an actual application scenario. For example, in the intelligent customer service scenario, the text to be synthesized can be the reply text to the user's questions, instructions, etc. For example, to the user instruction "Please help me check this month's phone bill", the corresponding reply text can be "Inquiry please Wait a moment", so that the reply text can be used as the text to be synthesized; or, in the novel reading scene, the text to be synthesized can also be the conversation text of the character, such as the conversation t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a speech synthesis method, electronic equipment and a storage device, and the method comprises the steps: obtaining a to-be-synthesized text and a target emotion type of the to-be-synthesized text, and obtaining a reference audio of the target emotion type; based on the data distribution condition of the reference audio about the rhythm parameters, obtaining rhythm features corresponding to the target emotion type; wherein the rhythm parameters comprise at least one of the following parameters: fundamental frequency, intensity and duration; performing feature extraction on the phoneme sequence of the to-be-synthesized text to obtain phoneme features of the to-be-synthesized text; and decoding by utilizing the rhythm features and the phoneme features to obtain a synthesized audio after the to-be-synthesized text is fused into the target emotion type. According to the scheme, the emotion can be accurately fused into the synthesized audio.

Description

technical field [0001] The present application relates to the technical field of artificial intelligence, in particular to a speech synthesis method, electronic equipment, and a storage device. Background technique [0002] Speech synthesis refers to the technology of converting text into audio, so that the machine can speak according to the text. At present, speech synthesis has been applied in many scenarios such as intelligent customer service, novel reading, and intelligent vehicles. However, the current speech synthesis lacks emotion. For example, in the interaction scene, the intelligent customer service cannot empathize with the user during the interaction process, thus greatly reducing the user experience. In view of this, how to accurately incorporate emotion into synthesized audio has become a topic of great research value. Contents of the invention [0003] The main technical problem to be solved in this application is to provide a speech synthesis method, ele...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/02G10L15/16G10L25/63
CPCG10L13/02G10L15/16G10L25/63
Inventor 王瑾薇胡亚军江源
Owner UNIV OF SCI & TECH OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products