Speech synthesis method, related device, equipment and storage medium

A technology of speech synthesis and speech, which is applied in speech synthesis, speech analysis, instruments, etc. It can solve the problems of unsatisfactory synthesis speed, low sound quality, large amount of calculation of vocoder, etc., and achieve less running times, fast running speed, The effect of reducing the amount of calculation

Pending Publication Date: 2022-04-08
UNIV OF SCI & TECH OF CHINA +1
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In recent years, with the continuous development of technology, neural network vocoders with high naturalness and high sound quality have appeared one after another, but these vocoders often have a large amount of calculation and the synthesis speed is not ideal
Although the vocoder based on traditional signal processing has a fast synthesis speed, the sound quality is not high

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech synthesis method, related device, equipment and storage medium
  • Speech synthesis method, related device, equipment and storage medium
  • Speech synthesis method, related device, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021] The solutions of the embodiments of the present application will be described in detail below in conjunction with the accompanying drawings.

[0022] In the following description, for purposes of illustration rather than limitation, specific details, such as specific system architectures, interfaces, and techniques, are set forth in order to provide a thorough understanding of the present application.

[0023] The terms "system" and "network" are often used interchangeably herein. The term "and / or" in this article is just an association relationship describing associated objects, which means that there can be three relationships, for example, A and / or B can mean: A exists alone, A and B exist simultaneously, and there exists alone B these three situations. In addition, the character " / " in this article generally indicates that the contextual objects are an "or" relationship. In addition, "many" herein means two or more than two.

[0024] see figure 1 , figure 1 It ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a speech synthesis method, a related device, equipment and a storage medium, and the method comprises the steps: extracting a plurality of frame-level acoustic features based on a to-be-synthesized text; performing prediction based on each frame-level acoustic feature to obtain an acoustic parameter corresponding to each frame-level acoustic feature; based on the excitation parameter, the noise parameter and the acoustic parameter corresponding to the frame-level acoustic feature, performing fusion to obtain a frequency spectrum value corresponding to the frame-level acoustic feature; and based on the frequency spectrum value corresponding to each frame-level acoustic feature, obtaining a synthetic speech. According to the scheme, the speech synthesis efficiency and quality can be improved.

Description

technical field [0001] The present application relates to the technical field of speech synthesis, in particular to a speech synthesis method and related devices, equipment and storage media. Background technique [0002] Speech synthesis is a method of converting text into speech, mainly including front-end, acoustic model and vocoder. The vocoder is a method of converting speech features such as spectrum into speech, and is an important part of the speech synthesis system. In recent years, with the continuous development of technology, neural network vocoders with high naturalness and high sound quality have appeared one after another, but these vocoders often have a large amount of calculation and the synthesis speed is not ideal. Although the vocoder based on traditional signal processing has a fast synthesis speed, the sound quality is not high. In view of this, under the premise of ensuring the naturalness and sound quality of speech synthesis, how to realize an effi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/02G10L19/02G10L19/08G10L19/16G10L25/60
Inventor 钟良胡亚军伍宏传江源
Owner UNIV OF SCI & TECH OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products