Speech synthesis system based on mixed hidden Markov model

A hidden Markov, speech synthesis technology, applied in speech synthesis, speech analysis, instruments, etc., can solve problems such as time domain over-flatness and inability to describe

Inactive Publication Date: 2009-07-01
INST OF AUTOMATION CHINESE ACAD OF SCI
View PDF0 Cites 22 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

If a certain state lasts for a long time, only relying on the mean value of the Gaussian function corresponding to the state cannot describe the details of the speech parameter changes in the state, which causes a serious time-domain over-smoothing problem

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech synthesis system based on mixed hidden Markov model
  • Speech synthesis system based on mixed hidden Markov model
  • Speech synthesis system based on mixed hidden Markov model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] The present invention will be further described below with reference to the accompanying drawings and examples, and the steps and processes for realizing the present invention will be better described through the detailed description of each component of the system with reference to the accompanying drawings. It should be noted that the described examples are to be considered for illustrative purposes only and are not intended to limit the invention.

[0038] figure 1 It is a schematic diagram of the speech synthesis system based on the hybrid hidden Markov model of the present invention. The system is written in C language, and can be compiled and run using visual studio under the windows platform, and can be compiled and run under the linux platform using gcc. in the attached figure 1 In a preferred embodiment of the present invention, the system is divided into four parts: a spectrum information generation module 1, a fundamental frequency information generation mod...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a voice synthesis system based on a mixing hidden Markov model, wherein a frequency spectrum information generating module receives any text information, selects the codebook vector which represents frequency spectrum information and outputs the frequency spectrum information, a base frequency information generating module receives the text information, takes charge of predicting the pitch change of a to-be synthetic sentence and outputs a base frequency curve, a parameter voice synthesizer module receives the frequency spectrum information of the frequency spectrum information generating module and the base frequency information of the base frequency information generating module, outputs the synthesized voice results, an off-line training module takes charge of training various hidden Markov models, a discrete hidden Markov model obtains the output probability of the real frequency spectrum vector, guarantees the accuracy of the frequency spectrum information, and the frequency spectrum guaranteed by the codebook choosing arithmetic can not generate the oversmoothing phenomenon of time-domain. Using the system to improve the articulation of the output voice of the parameter voice synthesis system, the fidelity of the output voice is greatly improved, which is almost close to the voice quality based on a splicing voice synthesis system.

Description

technical field [0001] The present invention relates to a speech synthesis system, in particular to a speech synthesis system based on a hybrid hidden Markov model. Background technique [0002] Speech synthesis system, also known as text-to-speech conversion system (TTS system), its main function is to convert any text string received or input by the computer into speech output. The traditional speech synthesis system is based on unit splicing, and its sound quality is good, but the required sound library resources are relatively large, which causes its application in embedded devices to encounter bottlenecks. The speech synthesis system based on Hidden Markov Model is essentially a parametric synthesis system, which has the advantages of high flexibility and small storage resources. However, due to the nature of its parameterization, its sound quality performance is usually much worse than that of the splicing-based synthesis system, which is also the bottleneck of the cu...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/02G10L13/06G10L13/08G10L13/027
Inventor 陶建华于剑张蒙
Owner INST OF AUTOMATION CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products