Speech synthesis method and device, model training method and device, equipment and storage medium

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A speech synthesis and model technology, applied in speech synthesis, speech analysis, instruments, etc., can solve the problem of high cost of high-quality speech data acquisition, and achieve the effect of flexible methods and guaranteed accuracy.

Active Publication Date: 2021-02-12

BEIJING BAIDU NETCOM SCI & TECH CO LTD

View PDF11 Cites 13 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0003] The existing speech synthesis technology uses a large amount of high-quality speech data to train the corresponding model, but the acquisition cost of high-quality speech data is very high

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0043] Exemplary embodiments of the present application are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present application to facilitate understanding, and they should be regarded as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the application. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.

[0044] figure 1 is a schematic diagram according to the first embodiment of the present application; as figure 1 As shown, this embodiment provides a speech synthesis method, which may specifically include the following steps:

[0045] S101. Based on the text information, timbre information, and prosody information of the speech to be synthesized, use a pre-trained...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a speech synthesis method and device, a model training method and device, equipment and a storage medium, and relates to the technical field of artificial intelligence such asmachine learning and intelligent speech. The specific implementation scheme is as follows: based on text information, timbre information and rhythm information of to-be-synthesized speech, generatingacoustic feature information of the to-be-synthesized speech by adopting a pre-trained speech synthesis model; based on the acoustic feature information of the to-be-synthesized voice, synthesizing the corresponding voice by adopting a pre-trained vocoder. By adopting the technical scheme of the invention, when the voice is synthesized, any text information, tone information and rhythm informationcan be randomly combined and synthesized into the desired voice, and the voice synthesis mode is very flexible and convenient.

Description

technical field [0001] The present application relates to the field of computer technology, specifically to the field of artificial intelligence technology such as machine learning and intelligent speech, and in particular to a speech synthesis method, a model training method, a device, a device, and a storage medium. Background technique [0002] In recent years, with the maturity of speech technology, speech synthesis technology is gradually being applied to speech signal processing systems such as speech interaction, sound broadcast, and personalized sound production. In the field of society and business, synthetic sound, as a manifestation of sound, brings convenience and richness to social life, and has potentially broad use value. [0003] The existing speech synthesis technology uses a large amount of high-quality speech data to train the corresponding model, but the acquisition cost of high-quality speech data is very high. Personalized speech synthesis can use a sm...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L13/08G10L13/10G10L25/30G10L19/16

CPCG10L13/08G10L13/10G10L25/30G10L19/16Y02T10/40

Inventor 王俊超陈昌滨袁俊聂志朋

Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD

Speech synthesis method and device, model training method and device, equipment and storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology