Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Frequency spectrum modelling and voice reinforcing method based on line spectrum frequency and its interorder differential parameter

A line spectrum frequency and speech enhancement technology, which is applied in speech synthesis, speech analysis, instruments, etc., can solve the problems of synthetic speech clarity reduction, formant weakening, etc., and achieve the effect of easy and robust solution and stable coefficients

Active Publication Date: 2006-08-09
IFLYTEK CO LTD
View PDF0 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in the process of modeling parameters, certain averaging processing is often introduced, which makes the spectral envelope corresponding to the spectral parameters output by the model prediction too smooth, and the formant is weakened, resulting in a decline in the intelligibility of synthesized speech

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Frequency spectrum modelling and voice reinforcing method based on line spectrum frequency and its interorder differential parameter
  • Frequency spectrum modelling and voice reinforcing method based on line spectrum frequency and its interorder differential parameter
  • Frequency spectrum modelling and voice reinforcing method based on line spectrum frequency and its interorder differential parameter

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] Concrete implementation of the present invention is as follows:

[0035] 1. Spectrum parametric analysis of training speech data

[0036] 1) Find the linear prediction coefficient for the speech signal by frame: through the method of fixed frame shift plus window multiplication (Gaussian window, the window width is twice the length of the pitch period, and the frame shift is 5 milliseconds) to obtain the short-term signal of each frame of speech Waveform, and then obtain the linear prediction coefficients of each order corresponding to the frame signal. The calculation method can use the linear prediction coefficient calculation method based on the time-domain waveform autocorrelation coefficient; it can also use the adaptive weighted spectral interpolation method, first calculate the spectrum envelope corresponding to the frame of speech, and then use the all-pole model to fit Solve for linear predictor coefficients. When calculating, the parameter order can be set d...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The method includes following steps: when picking up parameters of frequency spectrum, the method considers difference between orders in line spectrum frequency as a part of picked up result; when modeling a model and training, carrying out independent modeling and training for line spectrum frequency and parameters of difference between orders; when making prediction, predicting line spectrum frequency and parameters of difference between orders respectively, and moreover carrying out adjustment for parameters of frequency spectrum by using difference between orders; finally, using adjusted parameters of frequency spectrum synthesizes output voice in order to reach purpose of raising tone quality of synthesized voice through enhancing and sharpening formant of synthesized voice.

Description

technical field [0001] The invention relates to a speech synthesis method, specifically adding consideration of the inter-order difference parameters in the speech spectrum parameterization and modeling process based on the line spectrum frequency, and realizing the synthesis of speech by rationally utilizing the line spectrum frequency inter-order difference parameters The purpose of formant enhancement is to improve the intelligibility of synthesized speech. Background technique [0002] Existing speech synthesis techniques mainly include speech synthesis methods based on waveform splicing and speech synthesis methods based on parameter synthesis. The former can achieve higher sound quality and naturalness of synthesized speech by using the speech library containing natural acoustic samples and selecting units during synthesis. However, due to the use of the voice library, there is often a relatively large consumption of storage capacity, and it is difficult to realize th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L13/08G10L13/02G10L13/00G10L21/02G10L21/0264
Inventor 凌震华王玉华王仁华
Owner IFLYTEK CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products