Unlock instant, AI-driven research and patent intelligence for your innovation.

Multi-microphone Speech Enhancement Method Based on Tensor Decomposition

A speech enhancement and tensor decomposition technology, applied in signal processing, frequency response correction, electrical components, etc., can solve problems such as multi-noise, performance loss of beamforming technology, residual, etc. Effect

Active Publication Date: 2019-11-22
UNIV OF SCI & TECH OF CHINA
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, vector-based representation methods cannot make full use of the space, time, and frequency information carried in multi-channel data, so there is room for improvement
[0004] In addition, in many cases the noise received by the microphone array is not completely directional interference, which makes the beamforming technology based on the principle of spatial filtering extremely vulnerable to performance loss
For background noise with no obvious directionality or even no directionality, the spatial filtering effect of the beamforming algorithm is not good, which will bring more noise residues

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-microphone Speech Enhancement Method Based on Tensor Decomposition
  • Multi-microphone Speech Enhancement Method Based on Tensor Decomposition
  • Multi-microphone Speech Enhancement Method Based on Tensor Decomposition

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] The technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some of the embodiments of the present invention, not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0042] A tensor decomposition-based multi-microphone speech enhancement method provided by an embodiment of the present invention, such as figure 1 As shown, it mainly includes the following steps:

[0043] Step 11. Select what kind of orthogonal basis matrix (3DDCT orthogonal basis, supervised orthogonal basis, unsupervised orthogonal basis).

[0044] Step 12: Project the tensor on the selected basis matrix; select an optimal threshold to trunca...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a multi-microphone voice enhancement method based on tensor decomposition, which comprises the following steps: a 3D tensor is used for expressing multi-channel voice signals observed by a plurality of microphones, and the tensor is projected to a series of orthogonal basis; a minimization statistical risk criterion is adopted to collect a noise section in real time; noisecovariance is tracked, and an optimal threshold is calculated according to the size of the tensor block. According to the method, the received multi-channel data are represented as a three-order tensor, so that original spatial information and time information are reserved, background noise and weak directional noise can be removed more obviously, and voice distortion can be reduced as much as possible.

Description

technical field [0001] The invention relates to the field of voice noise reduction, in particular to a multi-microphone voice enhancement method based on tensor decomposition. Background technique [0002] In the field of speech enhancement, although the classic single-channel algorithm can remove more background noise, it is easy to cause speech distortion, and even bring "music" noise, resulting in speech quality damage. By using a microphone array and using a beamforming algorithm, better suppression of directional interference can be achieved. [0003] Traditional speech enhancement algorithms based on microphone arrays can be divided into time-domain noise reduction algorithms and frequency-domain noise reduction algorithms. The time-domain algorithm usually stitches the speech frames output by each microphone, and performs optimal linear filtering on this extended frame. The frequency domain algorithm performs Fourier transform on each microphone frame, extracts the ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): H04R3/04
CPCH04R3/04H04R2430/00
Inventor 叶中付童仁杰
Owner UNIV OF SCI & TECH OF CHINA