Speech synthesis method and device and computer readable storage medium

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
A technology of speech synthesis and storage medium, applied in speech synthesis, speech analysis, instruments, etc., can solve the problems of lack of emotion, dull sound, and incoherent splicing algorithm

Pending Publication Date: 2021-03-16

出门问问(苏州)信息科技有限公司

View PDF0 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0003] The sound generated by parametric speech synthesis has good sound quality, but the disadvantage is that the sound is flat, lacks emotion, and contains some background sounds

[0004] The voice generated by splicing speech synthesis has high sound quality. The disadvantage is that a large number of recorded sounds are required to meet the needs of different scenarios. In addition, the splicing algorithm often has the problem of splicing incoherence.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0024] In order to make the purpose, features and advantages of the present invention more obvious and understandable, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described The embodiments are only some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by those skilled in the art without making creative efforts belong to the protection scope of the present invention.

[0025] figure 1 It is a schematic diagram of the implementation flow of a speech synthesis method according to an embodiment of the present invention;

[0026] figure 2 It is a schematic diagram of a process of using a duration model and an acoustic model in a speech synthesis method according to an embodiment of the present invention.

...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a voice synthesis method and device and a computer readable storage medium, and the method comprises the steps: obtaining text information, inputting the obtained text information into an acoustic model based on an end-to-end neural network for coding, and generating a first content vector for summarizing the text information through coding; receiving voice duration information for each piece of sub-text information in the text information; according to the received voice duration information, adjusting the voice duration for the sub-text information in the first content vector, and generating a second content vector; and generating voice information corresponding to the text information according to the generated second content vector. Thus, in the speech synthesisprocess, the text duration in the first content vector is controlled, so that the generated sound becomes emotional, coherent and smooth on the basis that the tone quality is not reduced.

Description

technical field [0001] The present invention relates to the field of speech synthesis, in particular to a speech synthesis method, device and computer-readable storage medium. Background technique [0002] Speech synthesis refers to the technology that the computer automatically generates corresponding speech based on the text. The traditional speech synthesis technology is divided into parametric speech synthesis and splicing speech synthesis. [0003] The sound quality generated by parametric speech synthesis is good. The disadvantage is that the sound is flat, lacks emotion, and contains some background sounds. [0004] The voice generated by splicing speech synthesis has high sound quality. The disadvantage is that a large number of recorded sounds are required to meet the needs of different scenarios. In addition, the splicing algorithm often has the problem of splicing incoherence. Contents of the invention [0005] Embodiments of the present invention provide a spe...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L13/02G10L13/08G10L19/18G10L25/30

CPCG10L13/02G10L13/08G10L19/18G10L25/30

Inventor江明奇陈云琳殷昊杨喜鹏张旭

Owner出门问问(苏州)信息科技有限公司

Speech synthesis method and device and computer readable storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology