Frequency spectrum modelling and voice reinforcing method based on line spectrum frequency and its interorder differential parameter

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A line spectrum frequency and speech enhancement technology, which is applied in speech synthesis, speech analysis, instruments, etc., can solve the problems of synthetic speech clarity reduction, formant weakening, etc., and achieve the effect of easy and robust solution and stable coefficients

Active Publication Date: 2006-08-09

IFLYTEK CO LTD

View PDF0 Cites 18 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, in the process of modeling parameters, certain averaging processing is often introduced, which makes the spectral envelope corresponding to the spectral parameters output by the model prediction too smooth, and the formant is weakened, resulting in a decline in the intelligibility of synthesized speech

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0034] Concrete implementation of the present invention is as follows:

[0035] 1. Spectrum parametric analysis of training speech data

[0036] 1) Find the linear prediction coefficient for the speech signal by frame: through the method of fixed frame shift plus window multiplication (Gaussian window, the window width is twice the length of the pitch period, and the frame shift is 5 milliseconds) to obtain the short-term signal of each frame of speech Waveform, and then obtain the linear prediction coefficients of each order corresponding to the frame signal. The calculation method can use the linear prediction coefficient calculation method based on the time-domain waveform autocorrelation coefficient; it can also use the adaptive weighted spectral interpolation method, first calculate the spectrum envelope corresponding to the frame of speech, and then use the all-pole model to fit Solve for linear predictor coefficients. When calculating, the parameter order can be set d...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The method includes following steps: when picking up parameters of frequency spectrum, the method considers difference between orders in line spectrum frequency as a part of picked up result; when modeling a model and training, carrying out independent modeling and training for line spectrum frequency and parameters of difference between orders; when making prediction, predicting line spectrum frequency and parameters of difference between orders respectively, and moreover carrying out adjustment for parameters of frequency spectrum by using difference between orders; finally, using adjusted parameters of frequency spectrum synthesizes output voice in order to reach purpose of raising tone quality of synthesized voice through enhancing and sharpening formant of synthesized voice.

Description

technical field [0001] The invention relates to a speech synthesis method, specifically adding consideration of the inter-order difference parameters in the speech spectrum parameterization and modeling process based on the line spectrum frequency, and realizing the synthesis of speech by rationally utilizing the line spectrum frequency inter-order difference parameters The purpose of formant enhancement is to improve the intelligibility of synthesized speech. Background technique [0002] Existing speech synthesis techniques mainly include speech synthesis methods based on waveform splicing and speech synthesis methods based on parameter synthesis. The former can achieve higher sound quality and naturalness of synthesized speech by using the speech library containing natural acoustic samples and selecting units during synthesis. However, due to the use of the voice library, there is often a relatively large consumption of storage capacity, and it is difficult to realize th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L13/08G10L13/02G10L13/00G10L21/02G10L21/0264

Inventor凌震华王玉华王仁华

OwnerIFLYTEK CO LTD

Frequency spectrum modelling and voice reinforcing method based on line spectrum frequency and its interorder differential parameter

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology