Speech synthesis method and device, model training method and device, equipment and storage medium

A speech synthesis and model technology, applied in speech synthesis, speech analysis, instruments, etc., can solve the problem of high cost of high-quality speech data acquisition, and achieve the effect of flexible methods and guaranteed accuracy.

Active Publication Date: 2021-02-12
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF11 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The existing speech synthesis technology uses a large amount of high-quality speech data to train the corresponding model, but the acquisition cost of high-quality speech data is very high

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech synthesis method and device, model training method and device, equipment and storage medium
  • Speech synthesis method and device, model training method and device, equipment and storage medium
  • Speech synthesis method and device, model training method and device, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] Exemplary embodiments of the present application are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present application to facilitate understanding, and they should be regarded as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the application. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.

[0044] figure 1 is a schematic diagram according to the first embodiment of the present application; as figure 1 As shown, this embodiment provides a speech synthesis method, which may specifically include the following steps:

[0045] S101. Based on the text information, timbre information, and prosody information of the speech to be synthesized, use a pre-trained...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a speech synthesis method and device, a model training method and device, equipment and a storage medium, and relates to the technical field of artificial intelligence such asmachine learning and intelligent speech. The specific implementation scheme is as follows: based on text information, timbre information and rhythm information of to-be-synthesized speech, generatingacoustic feature information of the to-be-synthesized speech by adopting a pre-trained speech synthesis model; based on the acoustic feature information of the to-be-synthesized voice, synthesizing the corresponding voice by adopting a pre-trained vocoder. By adopting the technical scheme of the invention, when the voice is synthesized, any text information, tone information and rhythm informationcan be randomly combined and synthesized into the desired voice, and the voice synthesis mode is very flexible and convenient.

Description

technical field [0001] The present application relates to the field of computer technology, specifically to the field of artificial intelligence technology such as machine learning and intelligent speech, and in particular to a speech synthesis method, a model training method, a device, a device, and a storage medium. Background technique [0002] In recent years, with the maturity of speech technology, speech synthesis technology is gradually being applied to speech signal processing systems such as speech interaction, sound broadcast, and personalized sound production. In the field of society and business, synthetic sound, as a manifestation of sound, brings convenience and richness to social life, and has potentially broad use value. [0003] The existing speech synthesis technology uses a large amount of high-quality speech data to train the corresponding model, but the acquisition cost of high-quality speech data is very high. Personalized speech synthesis can use a sm...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/08G10L13/10G10L25/30G10L19/16
CPCG10L13/08G10L13/10G10L25/30G10L19/16
Inventor 王俊超陈昌滨袁俊聂志朋
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products