Unlock instant, AI-driven research and patent intelligence for your innovation.

Speech synthesis method, model training method and device

A technology of speech synthesis and model training, applied in the computer field, to achieve the effect of improving user experience, improving training accuracy, and improving flexibility

Pending Publication Date: 2022-05-17
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, current speech synthesis schemes cannot achieve multi-speaker and multi-style speech synthesis

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech synthesis method, model training method and device
  • Speech synthesis method, model training method and device
  • Speech synthesis method, model training method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0042] Exemplary embodiments of the present disclosure are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present disclosure to facilitate understanding, and they should be regarded as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the disclosure. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.

[0043] In the technical solution of this disclosure, the collection, storage, use, processing, transmission, provision, and disclosure of user personal information involved are all in compliance with relevant laws and regulations, and do not violate public order and good customs.

[0044] In the description of the embodiments of the present disclosure, it should be unde...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a speech synthesis method and device and a model training method and device, and relates to the technical field of computers, in particular to the technical field of speech synthesis and speech transcription. According to the specific technical scheme, the method comprises the steps of obtaining a to-be-processed text, a speech style identifier and a speaker identifier; performing feature extraction based on the text to obtain text features; performing feature extraction based on the text and the speech style identifier to obtain style features; performing feature extraction based on the speaker identifier to obtain speaker features; a synthetic audio is obtained based on the textual feature, the style feature, and the speaker feature. According to the technical scheme, the multi-style speech synthesis requirements of multiple speakers can be met.

Description

technical field [0001] The present disclosure relates to the technical field of computers, in particular to the technical fields of speech synthesis and speech transcription, and in particular to a speech synthesis method, a model training method and a device. Background technique [0002] Speech synthesis (Text To Speech, TTS) technology can meet the needs of converting text into anthropomorphic voice, and open up the closed loop of human-computer interaction. This technology is widely applicable to business scenarios such as intelligent customer service, audio reading, news broadcast, human-computer interaction, etc. It improves the human-computer interaction experience and improves the construction efficiency of voice applications. However, current speech synthesis schemes cannot achieve multi-speaker and multi-style speech synthesis. Contents of the invention [0003] The disclosure provides a speech synthesis method, a model training method and a device. [0004] Ac...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/02G10L15/06G10L13/02G10L25/24G10L25/30
CPCG10L15/02G10L15/063G10L13/02G10L25/24G10L25/30
Inventor 赵情恩
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD