Multi-channel speech enhancement method

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A speech enhancement and multi-channel technology, applied in speech analysis, instruments, etc., can solve the problems of relying on noise estimation, the difficulty of accurately estimating the dimension of signal subspace, and the difficulty of determining the dimension of signal subspace, so as to achieve accurate estimation and overcome adverse effects , Overcoming the effect of noise energy fluctuations

Active Publication Date: 2013-03-13

INST OF AUTOMATION CHINESE ACAD OF SCI

View PDF2 Cites 15 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, in practical applications, due to the influence of factors such as room echo and non-stationary noise, the dimension of the signal subspace is often difficult to estimate accurately, so the prior knowledge of the low-rank model is difficult to accurately use

[0007] The fundamental defect of the traditional method of signal subspace dimension estimation is that it relies too much on noise estimation, while ignoring the variation law of the signal subspace dimension itself

When the SSA-based speech enhancement algorithm works in a real environment, due to the interference of noise, SVD or EVD will generate a large number of singular values or eigenvalues close to zero, and their value changes are continuous, which is also the signal subspace dimension. sure increased difficulty

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0023] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with specific embodiments and with reference to the accompanying drawings.

[0024] The present invention provides a method for estimating the dimension of a signal subspace based on the F norm. The specific steps of the method are as follows:

[0025] Step S1: Collect multiple noisy speech signals y(t) through a microphone array composed of N microphones, specifically including:

[0026] Step S11: a microphone array for collecting signals, which is composed of N equally spaced microphones. Suppose the noisy speech signal y collected by the nth microphone at time t n (t) for x n (t) and v n The sum of (k), namely

[0027] the y n (t)=g n *s(t)+v n (t)=x n (t)+v n (t) (1)

[0028] Among them, g n is the impulse response of the sound source s(t) to the nth microphone, x n(t) is the pure ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a multi-channel speech enhancement method in which a norm F is used for presenting the dimension of a signal subspace. The method comprises the following steps of: step 1, acquiring a multi-channel speech signal y(t) with a noise via a microphone array which is composed of N microphones, and calculating the noisy speech cross-correlation matrix Ryy of the multi-channel speech signal y(t), wherein t represents a discrete time point; step 2, estimating an additive noise cross-correlation matrix by virtue of a noise estimation algorithm; step 3, estimating a pure speech cross-correlation matrix by virtue of the noisy speech cross-correlation matrix Ryy and the additive noise cross-correlation matrix; step 4, estimating the dimension of the signal subspace by virtue of the pure speech cross-correlation matrix; step 5, performing generalized eigenvalue decomposition, and obtaining a time-domain constraint linear signal estimator by combining the dimension of the signal subspace and a Lagrangian multiplier mu; and step 6, filtering the multi-channel speech signal y(t) by virtue of the time-domain constraint linear signal estimator to obtain the enhanced speech.

Description

technical field [0001] The invention relates to the field of speech signal processing of speech enhancement and noise elimination, in particular to a multi-channel speech enhancement method. Background technique [0002] Signal Subspace Approach (SSA) is mainly based on matrix orthogonal decomposition theory, using Eigenvalue Decomposition (Eigenvalue Decomposition, EVD or Singular Value Decomposition, SVD) to decompose the vector space of noisy speech signal into Two parts that are orthogonal to each other: signal subspace (signal + noise) and noise subspace (noise only). Among them, EVD uses the autocorrelation matrix of the signal to realize the transformation from the time domain to the feature domain, which is regarded as dependent on the signal , which is often called the Karhunen-Loeve transform (KLT). Essentially, SSA-based speech enhancement is to zero the noise subspace while removing noise components in the signal subspace. [0003] SSA was first introduced into ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L21/02

Inventor 刘文举李超

Owner INST OF AUTOMATION CHINESE ACAD OF SCI

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Multi-channel speech enhancement method

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology