Voice synthesis method and device, storage medium and electronic equipment

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
A speech synthesis and speech feature technology, applied in the computer field, can solve problems such as error accumulation, slow speech synthesis efficiency, and affecting speech synthesis accuracy

Active Publication Date: 2020-07-10

BEIJING BYTEDANCE NETWORK TECH CO LTD

View PDF6 Cites 11 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, in the prior art, when multiple models are used for collaboration, the efficiency of speech synthesis is slow, and error accumulation is prone to occur, thus affecting the accuracy of speech synthesis

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

preparation example Construction

[0082] Optionally, in order to further broaden the applicability of the speech synthesis method above, after the audio information corresponding to the text to be synthesized is obtained in S13 above, background music may also be added to the audio information to further fit various usage scenarios. Specifically, the above method may further include the following steps.

[0083] Synthesizing the audio information and background music to obtain audio information corresponding to the text to be synthesized and the background music.

[0084] In an implementation manner, the above-mentioned background music may be preset music, that is, any music set by the user, or default music.

[0085] In another embodiment, according to the background music selection instruction triggered by the user, the music indicated by the background music selection instruction can be determined as the background music, so that the user can diversify and dynamically select the corresponding background mu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a voice synthesis method and device, a storage medium and electronic equipment, and the method comprises the steps: inputting a to-be-synthesized text into an information extraction model, and obtaining voice feature information corresponding to the to-be-synthesized text; inputting the voice feature information into a voice synthesis model to obtain acoustic features corresponding to the to-be-synthesized text, the voice synthesis model comprising a duration sub-model and an acoustics sub-model, and the duration sub-model and the acoustics sub-model being subjected tojoint training to obtain the voice synthesis model; and obtaining audio information corresponding to the to-be-synthesized text according to the acoustic features. Therefore, the acoustic features can be directly obtained through the voice synthesis model according to the voice feature information corresponding to the to-be-synthesized text without cooperation of multiple models, so that the voice synthesis efficiency can be improved, the error accumulation can be effectively reduced, and the accuracy of the voice synthesis method is improved.

Description

technical field [0001] The present disclosure relates to the field of computer technology, and in particular, to a speech synthesis method, device, storage medium and electronic equipment. Background technique [0002] In the prior art, multiple models are usually constructed during speech synthesis, so that text is converted into speech cooperatively based on the multiple models. However, in the prior art, when multiple models are used for collaboration, the efficiency of speech synthesis is slow, and error accumulation is prone to occur, thereby affecting the accuracy of speech synthesis. Contents of the invention [0003] This Summary is provided to introduce a simplified form of concepts that are described in detail later in the Detailed Description. This summary of the invention is not intended to identify key features or essential features of the claimed technical solution, nor is it intended to be used to limit the scope of the claimed technical solution. [0004]...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L13/02G10L13/04G10L13/08G10L13/10G10L25/03G10L25/18G10L25/24G10L25/30G10L19/02G10L19/18

CPCG10L13/02G10L13/08G10L13/10G10L19/0212G10L19/18G10L25/03G10L25/18G10L25/24G10L25/30

Inventor殷翔

OwnerBEIJING BYTEDANCE NETWORK TECH CO LTD

Voice synthesis method and device, storage medium and electronic equipment

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

preparation example Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology