Voice signal dynamic feature extraction method based on MUSIC and modulation spectrum filter

A speech signal and dynamic feature technology, applied in speech analysis, speech recognition, instruments, etc., can solve problems such as performance degradation, inability to reflect the dynamic characteristics of speech signals, and inability to fully mine dynamic information, so as to improve performance. , to avoid instability, good robustness

Inactive Publication Date: 2013-04-03
BOHAI UNIV
View PDF4 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, once these parameters are applied to noisy environments, their performance will drop sharply
[0003] Moreover, the characteristic parameters mentioned above all reflect the static characteristics of the speech. The dynamic characteristics of the

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice signal dynamic feature extraction method based on MUSIC and modulation spectrum filter
  • Voice signal dynamic feature extraction method based on MUSIC and modulation spectrum filter
  • Voice signal dynamic feature extraction method based on MUSIC and modulation spectrum filter

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] Below in conjunction with accompanying drawing and embodiment, the technical solution of the present invention is described in detail:

[0046] Such as figure 1 As shown, the method includes speech signal preprocessing, MUSIC spectrum estimation, modulation spectrum filtering, modulation spectrum energy sum calculation, logarithmic energy calculation (Log) and discrete cosine transform (DCT). The specific process is as follows:

[0047] 1. Speech signal preprocessing

[0048] The voice signal is input through the microphone for sampling. The sampling frequency can be 8kHz, 11.025kHz, 16kHz, 22.050kHz, and the quantization precision can be 8bit or 16bit. In this example, the processing unit performs sampling and quantization with the sampling frequency of 11.025kHz and the quantization precision of 16bit. Obtain the corresponding voice data, and then use a first-order digital pre-emphasis filter to achieve pre-emphasis. The coefficient value range of the pre-emphasis fi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A voice signal dynamic feature extraction method based on MUSIC (multiple signal classification) and modulation spectrum filter mainly comprises voice signal preprocessing, MUSIC spectrum estimating, modulation spectrum filter, spectrum energy modulating and calculating, log energy calculating and discrete cosine transformation (DCT). The modulation spectrum filter can not only fully reflect dynamic features among voices but has low sensitivity of voice environment owing to the modulation spectrum filter has a time-frequency clustering, so that according to different reflections of interference signals and voice signals on the modulation information, the modulation spectrum filter uses spectrum estimating technology based on multiple signal classification, modulates the spectrum filter of obtained MUSIC spectrum and extracts cepstrum coefficients of the MUSIC spectrum as feature parameters. Compared with the prior art, the voice signal dynamic feature extraction method based on MUSIC and modulation spectrum filter has good robustness and not only significantly improves the recognition rate of a voice recognition system but has played a good foreshadowing role for follow-up studies of voice signals.

Description

technical field [0001] The invention relates to a method for extracting dynamic feature parameters of Chinese speech signals, in particular to a method for extracting dynamic feature parameters of speech signals based on MUSIC and modulation spectrum filtering. Background technique [0002] The most basic and most important development link of speech recognition is the extraction of speech signal characteristic parameters. As early as the 1940s, R.K. Potter and others proposed the concept of "Visible Speech", pointing out that spectrograms have a strong ability to describe speech signals, and tried to use spectral information for speech recognition, which formed the earliest phonetic features. In the 1950s, people found that in order to recognize the speech signal, some parameters that can reflect the speech characteristics must be extracted from the speech waveform, which can not only reduce the number of templates, the amount of calculation and storage, but also filter ou...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L15/02
Inventor 韩志艳伦淑娴王健郭艳东王东郭兆正王丽君
Owner BOHAI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products