Unit selection voice synthetic method based on acoustics statistical model

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A statistical model and speech synthesis technology, applied in speech synthesis, speech analysis, instruments, etc., can solve the problem of unsatisfactory sound quality of restored speech

Active Publication Date: 2008-05-14

IFLYTEK CO LTD

View PDF0 Cites 53 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, due to the limitation of the parametric synthesizer, the sound quality of this synthesis method is often not ideal

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0037] See attached picture. The unit selection speech synthesis method based on the acoustic statistical model, its implementation method comprises the following steps:

[0038] (1). Extract the acoustic features of the training corpus

[0039] The acoustic features we extract here include the frequency spectrum and fundamental frequency characteristic parameters corresponding to each frame. The spectral parameters we use here are mel-cepstrum parameters, and the fundamental frequency parameters are logarithmic F0 values. Dynamic parameters for frame parameter changes. Take the spectral feature s of the i-th frame of the phoneme n n，i For example,

[0040] s n , i = [ c n , i T , Δ ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a unit selection speech synthesis method based on an acoustic statistical model. The method consists of an extraction of each acoustic feature in training corpus, a training of the statistical model that is corresponding to all kinds of acoustic features combining with the symbol information as syllabic segment and rhythm of each sentence in training corpus and the requirement of statistical model of each acoustic feature corresponding to each phoneme in awaiting synthesis sentences through the text analysis of input text when synthesizes. Based on the norm of maximumlikelihood value between acoustic parameters in awaiting selection unit sequence and acoustic statistical model of awaiting synthesis sentences, the best awaiting selection unit is searched. A KLD between acoustic statistical models is used for achieving fast beforehand selection of synthetic units. Finally, the synthetic speech of a sentence is acquired through smoothing and splicing of best awaiting selection unit waveform of each phoneme.The invention improves the quality of synthetic speech and synthetic effect of traditional splicing and synthetic method and achieves an automatic establishment of the system and an independence of the language kind.

Description

technical field [0001] The invention relates to a unit selection method in waveform splicing speech synthesis, in particular, it guides the selection method of speech segment units by designing and training a group of acoustic statistical models. Background technique [0002] Speech synthesis is an important technology to realize natural and efficient human-computer interaction. There are two most common speech synthesis methods today, one is a synthesis method based on unit selection and waveform splicing, and the other is a parametric synthesis method based on an acoustic statistical model. [0003] In the traditional unit selection algorithm, the target cost and connection cost are often realized by calculating the difference of the context attributes between the units or the distance between the acoustic parameters of the candidate unit and the predicted target. The result of this is that the design of the cost function often requires the participation of language-relat...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L13/02G10L13/06G10L13/08

Inventor 凌震华胡郁胡国平吴晓如刘庆峰王仁华

Owner IFLYTEK CO LTD

Unit selection voice synthetic method based on acoustics statistical model

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology