Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech synthesis method and device and computer readable storage medium

A technology of speech synthesis and storage medium, applied in speech synthesis, speech analysis, instruments, etc., can solve the problems of lack of emotion, dull sound, and incoherent splicing algorithm

Pending Publication Date: 2021-03-16
出门问问(苏州)信息科技有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The sound generated by parametric speech synthesis has good sound quality, but the disadvantage is that the sound is flat, lacks emotion, and contains some background sounds
[0004] The voice generated by splicing speech synthesis has high sound quality. The disadvantage is that a large number of recorded sounds are required to meet the needs of different scenarios. In addition, the splicing algorithm often has the problem of splicing incoherence.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech synthesis method and device and computer readable storage medium
  • Speech synthesis method and device and computer readable storage medium
  • Speech synthesis method and device and computer readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] In order to make the purpose, features and advantages of the present invention more obvious and understandable, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described The embodiments are only some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by those skilled in the art without making creative efforts belong to the protection scope of the present invention.

[0025] figure 1 It is a schematic diagram of the implementation flow of a speech synthesis method according to an embodiment of the present invention;

[0026] figure 2 It is a schematic diagram of a process of using a duration model and an acoustic model in a speech synthesis method according to an embodiment of the present invention.

...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a voice synthesis method and device and a computer readable storage medium, and the method comprises the steps: obtaining text information, inputting the obtained text information into an acoustic model based on an end-to-end neural network for coding, and generating a first content vector for summarizing the text information through coding; receiving voice duration information for each piece of sub-text information in the text information; according to the received voice duration information, adjusting the voice duration for the sub-text information in the first content vector, and generating a second content vector; and generating voice information corresponding to the text information according to the generated second content vector. Thus, in the speech synthesisprocess, the text duration in the first content vector is controlled, so that the generated sound becomes emotional, coherent and smooth on the basis that the tone quality is not reduced.

Description

technical field [0001] The present invention relates to the field of speech synthesis, in particular to a speech synthesis method, device and computer-readable storage medium. Background technique [0002] Speech synthesis refers to the technology that the computer automatically generates corresponding speech based on the text. The traditional speech synthesis technology is divided into parametric speech synthesis and splicing speech synthesis. [0003] The sound quality generated by parametric speech synthesis is good. The disadvantage is that the sound is flat, lacks emotion, and contains some background sounds. [0004] The voice generated by splicing speech synthesis has high sound quality. The disadvantage is that a large number of recorded sounds are required to meet the needs of different scenarios. In addition, the splicing algorithm often has the problem of splicing incoherence. Contents of the invention [0005] Embodiments of the present invention provide a spe...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L13/02G10L13/08G10L19/18G10L25/30
CPCG10L13/02G10L13/08G10L19/18G10L25/30
Inventor 江明奇陈云琳殷昊杨喜鹏张旭
Owner 出门问问(苏州)信息科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products