Voice synthesis method and device, storage medium and electronic equipment

A speech synthesis and speech feature technology, applied in the computer field, can solve problems such as error accumulation, slow speech synthesis efficiency, and affecting speech synthesis accuracy

Active Publication Date: 2020-07-10
BEIJING BYTEDANCE NETWORK TECH CO LTD
View PDF6 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in the prior art, when multiple models are used for collaboration, the efficiency of speech syn

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice synthesis method and device, storage medium and electronic equipment
  • Voice synthesis method and device, storage medium and electronic equipment
  • Voice synthesis method and device, storage medium and electronic equipment

Examples

Experimental program
Comparison scheme
Effect test

preparation example Construction

[0082] Optionally, in order to further broaden the applicability of the speech synthesis method above, after the audio information corresponding to the text to be synthesized is obtained in S13 above, background music may also be added to the audio information to further fit various usage scenarios. Specifically, the above method may further include the following steps.

[0083] Synthesizing the audio information and background music to obtain audio information corresponding to the text to be synthesized and the background music.

[0084] In an implementation manner, the above-mentioned background music may be preset music, that is, any music set by the user, or default music.

[0085] In another embodiment, according to the background music selection instruction triggered by the user, the music indicated by the background music selection instruction can be determined as the background music, so that the user can diversify and dynamically select the corresponding background mu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a voice synthesis method and device, a storage medium and electronic equipment, and the method comprises the steps: inputting a to-be-synthesized text into an information extraction model, and obtaining voice feature information corresponding to the to-be-synthesized text; inputting the voice feature information into a voice synthesis model to obtain acoustic features corresponding to the to-be-synthesized text, the voice synthesis model comprising a duration sub-model and an acoustics sub-model, and the duration sub-model and the acoustics sub-model being subjected tojoint training to obtain the voice synthesis model; and obtaining audio information corresponding to the to-be-synthesized text according to the acoustic features. Therefore, the acoustic features can be directly obtained through the voice synthesis model according to the voice feature information corresponding to the to-be-synthesized text without cooperation of multiple models, so that the voice synthesis efficiency can be improved, the error accumulation can be effectively reduced, and the accuracy of the voice synthesis method is improved.

Description

technical field [0001] The present disclosure relates to the field of computer technology, and in particular, to a speech synthesis method, device, storage medium and electronic equipment. Background technique [0002] In the prior art, multiple models are usually constructed during speech synthesis, so that text is converted into speech cooperatively based on the multiple models. However, in the prior art, when multiple models are used for collaboration, the efficiency of speech synthesis is slow, and error accumulation is prone to occur, thereby affecting the accuracy of speech synthesis. Contents of the invention [0003] This Summary is provided to introduce a simplified form of concepts that are described in detail later in the Detailed Description. This summary of the invention is not intended to identify key features or essential features of the claimed technical solution, nor is it intended to be used to limit the scope of the claimed technical solution. [0004]...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L13/02G10L13/04G10L13/08G10L13/10G10L25/03G10L25/18G10L25/24G10L25/30G10L19/02G10L19/18
CPCG10L13/02G10L13/08G10L13/10G10L19/0212G10L19/18G10L25/03G10L25/18G10L25/24G10L25/30
Inventor 殷翔
Owner BEIJING BYTEDANCE NETWORK TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products