Unlock instant, AI-driven research and patent intelligence for your innovation.

A Double-Noise Speech Enhancement Method Using Multiple Modules to Suppress Different Kinds of Noise

A speech enhancement, multi-module technology, applied in speech analysis, instruments and other directions, can solve the problems of speech enhancement algorithm performance deterioration, the algorithm is not easy to show generalization and other problems, to achieve the effect of improving performance

Active Publication Date: 2020-07-31
UNIV OF ELECTRONICS SCI & TECH OF CHINA
View PDF12 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

For dual-noise scenes, general algorithms are not easy to show good generalization
In a low SNR environment, the performance of the speech enhancement algorithm will deteriorate significantly due to the dominant noise energy in the audio.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Double-Noise Speech Enhancement Method Using Multiple Modules to Suppress Different Kinds of Noise
  • A Double-Noise Speech Enhancement Method Using Multiple Modules to Suppress Different Kinds of Noise
  • A Double-Noise Speech Enhancement Method Using Multiple Modules to Suppress Different Kinds of Noise

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0036] see Figure 1-3 , the present invention provides a technical solution: a dual-noise speech enhancement method for multi-module suppression of different types of noise, comprising the following steps:

[0037] S1: Multiple types of noise are modeled in stages. For the input noisy speech, one or more noise features are extracted and filtered by the noise suppression module at each stage; among them, the loss function of each noise suppression module are not the same;

[0038] S2: The magnitude spectrum of the suppressed part of the noise and the magnitude spectrum of the original noisy speech are spliced ​​and input into the final neural network;

[0039] S3: Use the neural network to learn the mapping from the noisy amplitude spectrum to the pure amplitude spectrum, extract the features, and obtain the pure amplitude spectrum;

[0040] S4: The fitting target of the loss function of the intermediate noise suppression module is noisy speech, and the fitting target of the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a dual-noise speech enhancement method for multi-module suppression of different types of noise, which includes the following steps: S1: performing stage-by-stage modeling on various types of noise, and for the input noisy speech, through noise suppression at each stage The module extracts and filters one or more noise features; among them, the loss function of each noise suppression module is different; S2: the amplitude spectrum of the suppressed part of the noise in the process and the amplitude spectrum of the original noisy speech are spliced ​​and input into the final neural network network; the present invention proposes a multi-module staged dual-noise speech enhancement method that suppresses different types of noise. performance, and then integrate the enhanced results into the next stage, it uses the neural network to learn the mapping from the noisy magnitude spectrum to the purer magnitude spectrum at each stage, and refines the features to obtain a purer magnitude spectrum.

Description

technical field [0001] The invention belongs to the technical field of speech enhancement, in particular to a dual-noise speech enhancement method in which multiple modules suppress different types of noise. Background technique [0002] Speech enhancement algorithms are an important speech processing technology that power speech recognition systems, hearing aids, and military bugging devices. At present, the accuracy rate of speech recognition algorithms has reached a relatively high level, even surpassing skilled dictation transcriptionists in some public data sets. However, due to the existence of noise or reverberation interference, the speech recognition algorithm can only achieve the desired effect after speech enhancement. The current speech enhancement algorithms only perform well on noisy speech containing a single noise with a high signal-to-noise ratio. In real scenes such as meeting environment, battlefield environment and street environment, there will be many...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L21/0208G10L21/0316G10L25/30
CPCG10L21/0208G10L21/0316G10L25/30
Inventor 蓝天叶文政惠国强刘峤李森钱宇欣吕忆蓝彭川李萌
Owner UNIV OF ELECTRONICS SCI & TECH OF CHINA