Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech synthesis method and device, electronic equipment and storage medium

A technology of speech synthesis and timbre, applied in the computer field, can solve the problems of poor synthesis effect and heavy mechanical sense of audio.

Active Publication Date: 2021-03-16
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF17 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In the speech synthesis technology, the audio synthesized in the related art usually changes little from word to word, resulting in a heavy mechanical sense of the synthesized audio and poor synthesis effect

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech synthesis method and device, electronic equipment and storage medium
  • Speech synthesis method and device, electronic equipment and storage medium
  • Speech synthesis method and device, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021] Exemplary embodiments of the present application are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present application to facilitate understanding, and they should be regarded as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the application. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.

[0022] The speech synthesis method, device, electronic device, and storage medium of the embodiments of the present application are described below with reference to the accompanying drawings.

[0023] figure 1 It is a schematic flowchart of a speech synthesis method provided according to the first embodiment of the present application.

[0024] Such as figure 1 A...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a speech synthesis method and device, electronic equipment and a storage medium, and relates to the technical field of artificial intelligence such as deep learning and speechtechnology. The method comprises steps: in a process of performing voice synthesis on a to-be-synthesized text, obtaining timbre characteristics corresponding to a user identifier in combination withthe user identifier in a voice synthesis request, and obtaining at least one group of candidate rhythm characteristics of the to-be-synthesized text in combination with the user identifier; selectingone group from the at least one group of candidate rhythm features as the rhythm feature of the to-be-synthesized text; and performing voice synthesis according to the timbre features, the to-be-synthesized text and the rhythm features to obtain a synthesized audio corresponding to the to-be-synthesized text. Therefore, the synthesized audio of the to-be-synthesized text is synthesized by combining the timbre characteristics corresponding to the user identifier, the to-be-synthesized text and the rhythm characteristics, so that the obtained synthesized audio has the user voice characteristicscorresponding to the user identifier, the synthesized audio is more real and natural, and the voice synthesis effect is improved.

Description

technical field [0001] The present application relates to the field of computer technology, specifically to the field of artificial intelligence technology such as deep learning and speech technology, and in particular to speech synthesis methods, devices, electronic equipment and storage media. Background technique [0002] Speech synthesis (Text to Speech) is one of the important technologies and application directions in the field of artificial intelligence speech. It is the process of converting the text input by users or products into speech. The machine imitates the way humans "speak" and outputs anthropomorphic voices. It is mainly used in audio reading, man-machine dialogue, smart speakers, smart customer service and other scenarios, and is one of the main ways for people to interact naturally with machines. [0003] In the speech synthesis technology, the audio synthesized in the related art usually has little change between words, so that the synthesized audio has ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/02G10L13/10G10L25/30G10L25/27
CPCG10L13/02G10L13/10G10L25/30G10L25/27
Inventor 高占杰陈昌滨刘龙飞
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products