A binaural speech enhancement method based on deep learning

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A speech enhancement and binaural technology, applied in speech analysis, stereo systems, instruments, etc., can solve the problems of poor non-stationary noise suppression and no special processing of target speech space information, so as to improve robustness and suppress Effects of noise interference and binaural speech enhancement

Active Publication Date: 2021-03-23

INST OF ACOUSTICS CHINESE ACAD OF SCI

View PDF5 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

In most traditional speech enhancements with two-channel output, most of them only consider removing interference, and there is no special processing for the spatial information of the target speech, and the suppression effect on non-stationary noise is not good.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0031] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0032] figure 1 It is a flowchart of a binaural speech enhancement method based on deep learning. Such as figure 1 shown, including:

[0033] Step S101: Framing, windowing, and Fourier transform are respectively performed on the left channel noisy speech signal and the right channel noisy speech signal to obtain the left channel noisy spee...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a binaural speech enhancement method based on deep learning, comprising the following steps: respectively processing a left / right channel noisy speech signal containing a target speech signal to be enhanced so as to obtain a left / right frequency domain signal, and combining amplitudes thereof to obtain single-channel complex features; using the frequency domain signal of the left / right channel and corresponding target frequency domain signal theoretical value to calculate corresponding target speech ideal complex masking respectively; combining to form a target speech single-channel complex masking theoretical value, combining the single-channel complex features to train a complex feedforward neural network so as to obtain a binaural speech enhancement model; usingthe target speech single-channel complex masking estimated value outputted by the model to respectively process the left / right channel noisy speech signal so as to obtain a left / right channel frequency domain signal; and finally obtaining a corresponding target speech time-domain signal. By the method, noise interference can be suppressed and spatial information of a target sound source can be maintained. By making full use of the generalization ability of deep neural networks, the enhancement of binaural speech is achieved.

Description

technical field [0001] The invention relates to the technical field of speech enhancement, in particular to a binaural speech enhancement method based on deep learning. Background technique [0002] At present, speech enhancement technology is mainly to remove background noise and directional noise interference in speech signals, improve speech quality and intelligibility, and achieve better performance in speech recognition and human ear understanding. In the enhancement technology with single-channel speech as the output, the background noise can be suppressed by using the different characteristics of speech and noise in the time-frequency domain of the single-channel input, and the spatial information of the target speech and interference signals in the multi-channel input can be better. Remove directional noise. In binaural hearing, the human ear can use the spatial information difference between the target and the interference signal in the dual-channel speech to impro...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityPatents(China)

IPC IPC(8): G10L21/0232G10L25/30

CPCG10L21/0232G10L25/30H04S2420/01

Inventor李军锋孙兴伟夏日升颜永红

OwnerINST OF ACOUSTICS CHINESE ACAD OF SCI

A binaural speech enhancement method based on deep learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology