Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Unit selection voice synthetic method based on acoustics statistical model

A statistical model and speech synthesis technology, applied in speech synthesis, speech analysis, instruments, etc., can solve the problem of unsatisfactory sound quality of restored speech

Active Publication Date: 2008-05-14
IFLYTEK CO LTD
View PDF0 Cites 53 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, due to the limitation of the parametric synthesizer, the sound quality of this synthesis method is often not ideal

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Unit selection voice synthetic method based on acoustics statistical model
  • Unit selection voice synthetic method based on acoustics statistical model
  • Unit selection voice synthetic method based on acoustics statistical model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] See attached picture. The unit selection speech synthesis method based on the acoustic statistical model, its implementation method comprises the following steps:

[0038] (1). Extract the acoustic features of the training corpus

[0039] The acoustic features we extract here include the frequency spectrum and fundamental frequency characteristic parameters corresponding to each frame. The spectral parameters we use here are mel-cepstrum parameters, and the fundamental frequency parameters are logarithmic F0 values. Dynamic parameters for frame parameter changes. Take the spectral feature s of the i-th frame of the phoneme n n,i For example,

[0040] s n , i = [ c n , i T , Δ ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a unit selection speech synthesis method based on an acoustic statistical model. The method consists of an extraction of each acoustic feature in training corpus, a training of the statistical model that is corresponding to all kinds of acoustic features combining with the symbol information as syllabic segment and rhythm of each sentence in training corpus and the requirement of statistical model of each acoustic feature corresponding to each phoneme in awaiting synthesis sentences through the text analysis of input text when synthesizes. Based on the norm of maximumlikelihood value between acoustic parameters in awaiting selection unit sequence and acoustic statistical model of awaiting synthesis sentences, the best awaiting selection unit is searched. A KLD between acoustic statistical models is used for achieving fast beforehand selection of synthetic units. Finally, the synthetic speech of a sentence is acquired through smoothing and splicing of best awaiting selection unit waveform of each phoneme.The invention improves the quality of synthetic speech and synthetic effect of traditional splicing and synthetic method and achieves an automatic establishment of the system and an independence of the language kind.

Description

technical field [0001] The invention relates to a unit selection method in waveform splicing speech synthesis, in particular, it guides the selection method of speech segment units by designing and training a group of acoustic statistical models. Background technique [0002] Speech synthesis is an important technology to realize natural and efficient human-computer interaction. There are two most common speech synthesis methods today, one is a synthesis method based on unit selection and waveform splicing, and the other is a parametric synthesis method based on an acoustic statistical model. [0003] In the traditional unit selection algorithm, the target cost and connection cost are often realized by calculating the difference of the context attributes between the units or the distance between the acoustic parameters of the candidate unit and the predicted target. The result of this is that the design of the cost function often requires the participation of language-relat...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/02G10L13/06G10L13/08
Inventor 凌震华胡郁胡国平吴晓如刘庆峰王仁华
Owner IFLYTEK CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products