Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Improved spectrum subtraction method based on human ear masking effect and Bayesian estimation

A technology of Bayesian estimation and masking effect, applied in speech analysis, instruments, etc., can solve problems such as slow response speed, reduced speech enhancement effect, and reduced signal-to-noise ratio

Inactive Publication Date: 2018-11-02
NANJING UNIV OF POSTS & TELECOMM
View PDF2 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] Combining the auditory masking effect of the human ear is an important idea to eliminate the musical noise of spectral subtraction. Someone improved the spectral subtraction formula to: Y w (ω) is the spectral signal of noisy speech, In order to enhance the spectral signal of speech, most of the noise estimation algorithms used in existing technical solutions are not accurate enough, such as voice activity detection (VAD) or minimum value statistics, the reliability of the former will decrease with the decrease of signal-to-noise ratio, The latter responds slowly, which will affect the accuracy of noise estimation and reduce the effect of speech enhancement.
Moreover, there is also a misunderstanding in the current technical solutions. Too much emphasis on the elimination of music noise affects the intelligibility of the voice signal, destroys the voice signal, and even reduces the signal-to-noise ratio.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Improved spectrum subtraction method based on human ear masking effect and Bayesian estimation
  • Improved spectrum subtraction method based on human ear masking effect and Bayesian estimation
  • Improved spectrum subtraction method based on human ear masking effect and Bayesian estimation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0205] The present invention has been contrasted with other two kinds of algorithms, specifically as follows:

[0206] Method 1: traditional spectral subtraction,

[0207] See Berouti, M., Schwartz, M., and Makhoul, J. (1979). Enhancement of speech corrupted by acoustic noise. Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, 208-211.

[0208] Method 2: Spectral subtraction based on human ear masking effect, using voice activity detection (VAD) to estimate noise, spectral subtraction is unimproved filter spectral subtraction, see Cai Hantian, Yuan Botao. A speech enhancement algorithm based on auditory masking model[J ]. Journal of Communications, 2002(8):93-98.

[0209] Method three: the method of the present invention

[0210] These three methods are used to enhance the noisy speech with signal-to-noise ratios of -5dB, 0dB, and 5dB, and the noise type is white noise. The PESQ value is used to measure the intelligibility of speech.

[0211] PESQ (Perceptual evalua...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an improved spectrum subtraction method based on a human ear masking effect and Bayesian estimation. The improved spectrum subtraction method comprises the steps of: (1) adopting an improved minimum control value recursive averaging algorithm to obtain noise power spectrum estimation of an original noisy speech; (2) combining the obtained noise power spectrum estimation forperforming preliminary spectrum subtraction on a noisy speech signal; (3) performing Bayesian estimation based on weighted likelihood ratio distortion measurement on the signal after preliminary spectrum subtraction, and calculating the optimal estimated amplitude spectrum of the signal; (4) calculating a subtraction parameter of secondary spectrum subtraction by utilizing the human ear masking effect; (5) performing IMCRA noise estimation again before secondary spectrum subtraction, and carrying out secondary spectrum subtraction to obtain a final enhanced speech signal; (6) and performing inverse Fourier transform on the enhanced speech signal to obtain a final enhanced speech. The improved spectrum subtraction method better guarantees the intelligibility of the speech while improving the noise elimination capability of the algorithm, thereby improving the overall effect of speech enhancement.

Description

technical field [0001] The invention relates to an improved spectral subtraction method based on human ear masking effect and Bayesian estimation, belonging to the technical field of speech signal processing. Background technique [0002] Speech is an important way of information exchange between people, but people are always disturbed by various noises in the process of using voice to communicate and communicate. Noisy speech will not only increase human hearing fatigue and reduce the quality of speech communication, but also degrade the performance of the speech processing system based on feature parameter extraction. Therefore, in order to reduce the impact of background noise on speech quality, it is necessary to perform speech enhancement to suppress background noise. [0003] Spectral subtraction is a traditional enhancement algorithm. Its basic idea is to calculate the short-term amplitude spectrum of the noisy speech signal and the short-term amplitude spectrum of t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L21/02G10L21/0216
CPCG10L21/0216G10L21/0364
Inventor 邓立新吴卫鹏
Owner NANJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products