Improved spectrum subtraction method based on human ear masking effect and Bayesian estimation

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of Bayesian estimation and masking effect, applied in speech analysis, instruments, etc., can solve problems such as slow response speed, reduced speech enhancement effect, and reduced signal-to-noise ratio

Inactive Publication Date: 2018-11-02

NANJING UNIV OF POSTS & TELECOMM

View PDF2 Cites 16 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0006] Combining the auditory masking effect of the human ear is an important idea to eliminate the musical noise of spectral subtraction. Someone improved the spectral subtraction formula to: Y w (ω) is the spectral signal of noisy speech, In order to enhance the spectral signal of speech, most of the noise estimation algorithms used in existing technical solutions are not accurate enough, such as voice activity detection (VAD) or minimum value statistics, the reliability of the former will decrease with the decrease of signal-to-noise ratio, The latter responds slowly, which will affect the accuracy of noise estimation and reduce the effect of speech enhancement.

Moreover, there is also a misunderstanding in the current technical solutions. Too much emphasis on the elimination of music noise affects the intelligibility of the voice signal, destroys the voice signal, and even reduces the signal-to-noise ratio.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment

[0205] The present invention has been contrasted with other two kinds of algorithms, specifically as follows:

[0206] Method 1: traditional spectral subtraction,

[0207] See Berouti, M., Schwartz, M., and Makhoul, J. (1979). Enhancement of speech corrupted by acoustic noise. Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, 208-211.

[0208] Method 2: Spectral subtraction based on human ear masking effect, using voice activity detection (VAD) to estimate noise, spectral subtraction is unimproved filter spectral subtraction, see Cai Hantian, Yuan Botao. A speech enhancement algorithm based on auditory masking model[J ]. Journal of Communications, 2002(8):93-98.

[0209] Method three: the method of the present invention

[0210] These three methods are used to enhance the noisy speech with signal-to-noise ratios of -5dB, 0dB, and 5dB, and the noise type is white noise. The PESQ value is used to measure the intelligibility of speech.

[0211] PESQ (Perceptual evalua...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses an improved spectrum subtraction method based on a human ear masking effect and Bayesian estimation. The improved spectrum subtraction method comprises the steps of: (1) adopting an improved minimum control value recursive averaging algorithm to obtain noise power spectrum estimation of an original noisy speech; (2) combining the obtained noise power spectrum estimation forperforming preliminary spectrum subtraction on a noisy speech signal; (3) performing Bayesian estimation based on weighted likelihood ratio distortion measurement on the signal after preliminary spectrum subtraction, and calculating the optimal estimated amplitude spectrum of the signal; (4) calculating a subtraction parameter of secondary spectrum subtraction by utilizing the human ear masking effect; (5) performing IMCRA noise estimation again before secondary spectrum subtraction, and carrying out secondary spectrum subtraction to obtain a final enhanced speech signal; (6) and performing inverse Fourier transform on the enhanced speech signal to obtain a final enhanced speech. The improved spectrum subtraction method better guarantees the intelligibility of the speech while improving the noise elimination capability of the algorithm, thereby improving the overall effect of speech enhancement.

Description

technical field [0001] The invention relates to an improved spectral subtraction method based on human ear masking effect and Bayesian estimation, belonging to the technical field of speech signal processing. Background technique [0002] Speech is an important way of information exchange between people, but people are always disturbed by various noises in the process of using voice to communicate and communicate. Noisy speech will not only increase human hearing fatigue and reduce the quality of speech communication, but also degrade the performance of the speech processing system based on feature parameter extraction. Therefore, in order to reduce the impact of background noise on speech quality, it is necessary to perform speech enhancement to suppress background noise. [0003] Spectral subtraction is a traditional enhancement algorithm. Its basic idea is to calculate the short-term amplitude spectrum of the noisy speech signal and the short-term amplitude spectrum of t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L21/02G10L21/0216

CPCG10L21/0216G10L21/0364

Inventor 邓立新吴卫鹏

Owner NANJING UNIV OF POSTS & TELECOMM

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Improved spectrum subtraction method based on human ear masking effect and Bayesian estimation

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology