Multi-channel speech enhancement method

A speech enhancement and multi-channel technology, applied in speech analysis, instruments, etc., can solve the problems of relying on noise estimation, the difficulty of accurately estimating the dimension of signal subspace, and the difficulty of determining the dimension of signal subspace, so as to achieve accurate estimation and overcome adverse effects , Overcoming the effect of noise energy fluctuations

Active Publication Date: 2013-03-13
INST OF AUTOMATION CHINESE ACAD OF SCI
View PDF2 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in practical applications, due to the influence of factors such as room echo and non-stationary noise, the dimension of the signal subspace is often difficult to estimate accurately, so the prior knowledge of the low-rank model is difficult to accurately use
[0007] The fundamental defect of the traditional method of signal subspace dimension estimation is that it relies too much on noise estimation, while ignoring the variation law of the signal subspace dimension itself
When the SSA-based speech enhancement algorithm works in a real environment, due to the interference of noise, SVD or EVD will generate a large number of singular values ​​or eigenvalues ​​close to zero, and their value changes are continuous, which is also the signal subspace dimension. sure increased difficulty

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-channel speech enhancement method
  • Multi-channel speech enhancement method
  • Multi-channel speech enhancement method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with specific embodiments and with reference to the accompanying drawings.

[0024] The present invention provides a method for estimating the dimension of a signal subspace based on the F norm. The specific steps of the method are as follows:

[0025] Step S1: Collect multiple noisy speech signals y(t) through a microphone array composed of N microphones, specifically including:

[0026] Step S11: a microphone array for collecting signals, which is composed of N equally spaced microphones. Suppose the noisy speech signal y collected by the nth microphone at time t n (t) for x n (t) and v n The sum of (k), namely

[0027] the y n (t)=g n *s(t)+v n (t)=x n (t)+v n (t) (1)

[0028] Among them, g n is the impulse response of the sound source s(t) to the nth microphone, x n(t) is the pure ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a multi-channel speech enhancement method in which a norm F is used for presenting the dimension of a signal subspace. The method comprises the following steps of: step 1, acquiring a multi-channel speech signal y(t) with a noise via a microphone array which is composed of N microphones, and calculating the noisy speech cross-correlation matrix Ryy of the multi-channel speech signal y(t), wherein t represents a discrete time point; step 2, estimating an additive noise cross-correlation matrix by virtue of a noise estimation algorithm; step 3, estimating a pure speech cross-correlation matrix by virtue of the noisy speech cross-correlation matrix Ryy and the additive noise cross-correlation matrix; step 4, estimating the dimension of the signal subspace by virtue of the pure speech cross-correlation matrix; step 5, performing generalized eigenvalue decomposition, and obtaining a time-domain constraint linear signal estimator by combining the dimension of the signal subspace and a Lagrangian multiplier mu; and step 6, filtering the multi-channel speech signal y(t) by virtue of the time-domain constraint linear signal estimator to obtain the enhanced speech.

Description

technical field [0001] The invention relates to the field of speech signal processing of speech enhancement and noise elimination, in particular to a multi-channel speech enhancement method. Background technique [0002] Signal Subspace Approach (SSA) is mainly based on matrix orthogonal decomposition theory, using Eigenvalue Decomposition (Eigenvalue Decomposition, EVD or Singular Value Decomposition, SVD) to decompose the vector space of noisy speech signal into Two parts that are orthogonal to each other: signal subspace (signal + noise) and noise subspace (noise only). Among them, EVD uses the autocorrelation matrix of the signal to realize the transformation from the time domain to the feature domain, which is regarded as dependent on the signal , which is often called the Karhunen-Loeve transform (KLT). Essentially, SSA-based speech enhancement is to zero the noise subspace while removing noise components in the signal subspace. [0003] SSA was first introduced into ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L21/02
Inventor 刘文举李超
Owner INST OF AUTOMATION CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products