Multi-microphone Speech Enhancement Method Based on Tensor Decomposition

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A speech enhancement and tensor decomposition technology, applied in signal processing, frequency response correction, electrical components, etc., can solve problems such as multi-noise, performance loss of beamforming technology, residual, etc. Effect

Active Publication Date: 2019-11-22

UNIV OF SCI & TECH OF CHINA

View PDF6 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, vector-based representation methods cannot make full use of the space, time, and frequency information carried in multi-channel data, so there is room for improvement

[0004] In addition, in many cases the noise received by the microphone array is not completely directional interference, which makes the beamforming technology based on the principle of spatial filtering extremely vulnerable to performance loss

For background noise with no obvious directionality or even no directionality, the spatial filtering effect of the beamforming algorithm is not good, which will bring more noise residues

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0041] The technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some of the embodiments of the present invention, not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0042] A tensor decomposition-based multi-microphone speech enhancement method provided by an embodiment of the present invention, such as figure 1 As shown, it mainly includes the following steps:

[0043] Step 11. Select what kind of orthogonal basis matrix (3DDCT orthogonal basis, supervised orthogonal basis, unsupervised orthogonal basis).

[0044] Step 12: Project the tensor on the selected basis matrix; select an optimal threshold to trunca...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a multi-microphone voice enhancement method based on tensor decomposition, which comprises the following steps: a 3D tensor is used for expressing multi-channel voice signals observed by a plurality of microphones, and the tensor is projected to a series of orthogonal basis; a minimization statistical risk criterion is adopted to collect a noise section in real time; noisecovariance is tracked, and an optimal threshold is calculated according to the size of the tensor block. According to the method, the received multi-channel data are represented as a three-order tensor, so that original spatial information and time information are reserved, background noise and weak directional noise can be removed more obviously, and voice distortion can be reduced as much as possible.

Description

technical field [0001] The invention relates to the field of voice noise reduction, in particular to a multi-microphone voice enhancement method based on tensor decomposition. Background technique [0002] In the field of speech enhancement, although the classic single-channel algorithm can remove more background noise, it is easy to cause speech distortion, and even bring "music" noise, resulting in speech quality damage. By using a microphone array and using a beamforming algorithm, better suppression of directional interference can be achieved. [0003] Traditional speech enhancement algorithms based on microphone arrays can be divided into time-domain noise reduction algorithms and frequency-domain noise reduction algorithms. The time-domain algorithm usually stitches the speech frames output by each microphone, and performs optimal linear filtering on this extended frame. The frequency domain algorithm performs Fourier transform on each microphone frame, extracts the ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): H04R3/04

CPCH04R3/04H04R2430/00

Inventor 叶中付童仁杰

Owner UNIV OF SCI & TECH OF CHINA

Multi-microphone Speech Enhancement Method Based on Tensor Decomposition

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology