Speech de-noising method and speech de-noising device

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A voice denoising, non-speech technology, applied in voice analysis, instruments, etc., can solve problems such as poor results

Active Publication Date: 2017-03-08

苏州谦问万答吧教育科技有限公司

View PDF4 Cites 31 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, the noise in the speech part may be different from the noise in the non-speech part, especially if it is affected by the residual echo (there is multiplicative noise), and the overall speech signal Doesn't do well with denoising

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0023] figure 1 The flow chart of the voice denoising method provided in Embodiment 1 of the present invention, this embodiment can be used for voice denoising, the method can be executed by a voice denoising device, and the device can be implemented by software and / or hardware, the The device can be integrated into any intelligent terminal that provides voice denoising function. In specific implementation, the intelligent terminal can include: mobile terminals such as tablets, mobile phones, and e-readers. The above terminals are only examples, not exhaustive, including but not exhaustive. Limited to the smart terminals mentioned above.

[0024] see figure 1 , the method for speech denoising, comprising:

[0025] S110. Perform speech detection on the noisy speech signal to distinguish speech frames from non-speech frames.

[0026] The speech signal received by the smart terminal is a non-stationary time-varying noisy speech signal formed after being disturbed by the enviro...

Embodiment 2

[0044] figure 2 The flow chart of the speech denoising method provided by Embodiment 2 of the present invention. On the basis of the above embodiments, this embodiment proposes an effective speech feature combination when performing speech detection on noisy speech signals, which can more accurately distinguish Separate noisy speech frames and non-speech frames.

[0045] see figure 2 , the method for speech denoising, comprising:

[0046] S210. Extract speech features of the noisy speech signal.

[0047] The extracted noisy speech features include Mel cepstral coefficient MFCC, linear predictive coding residual and spectral centroid Centroid. Human beings have different perceptual abilities to different frequencies of speech: below 1kHz, it has a linear relationship with the frequency, and above 1kHz, it has a logarithmic relationship with the frequency. The higher the frequency, the worse the perception. In applications, only low-frequency MFCCs are often used, while m...

Embodiment 3

[0077] image 3 It is a flow chart of the speech denoising method provided by Embodiment 3 of the present invention. On the basis of the foregoing embodiments, this embodiment performs stationary noise suppression, non-speech noise suppression, and non-stationary noise suppression on noisy speech signals.

[0078] see image 3 , the method for speech denoising, comprising:

[0079] S310. Perform speech detection on the noisy speech signal to distinguish speech frames from non-speech frames.

[0080] S320. Perform noise estimation on the speech frame and the non-speech frame respectively, to obtain a fusion estimation value of the noise power spectrum.

[0081] S330. Perform stationary noise suppression, non-speech noise suppression, and non-stationary noise suppression on the noisy speech signal according to the fused estimated value of the noise power spectrum.

[0082] Exemplarily, after the fused estimated value of the noise power spectrum is obtained, denoising processi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

Embodiments of the invention disclose a speech de-noising method and a speech de-noising device. The method comprises the following steps: detecting the speech of a speech signal with noise to distinguish between speech frames and non-speech frames; estimating the noise of the speech frames and the noise of the non-speech frames to get a noise power spectrum fused estimated value, wherein the noise power spectrum fusion estimated value is the fused value of the noise power spectrum estimated value of the speech frames and the noise power spectrum estimated value of the non-speech frames; and de-noising the speech signal with noise according to the noise power spectrum fusion estimated value. According to the technical scheme provided by the embodiments of the invention, the noise of the speech frames and the noise of the non-speech frames are estimated, and the speech signal with noise is de-noised based on the noise estimation results of the speech frames and the non-speech frames. Thus, the de-noising effect of the existing speech de-noising scheme is improved effectively, and the quality of speech is improved.

Description

technical field [0001] Embodiments of the present invention relate to speech signal processing technologies, and in particular, to a speech denoising method and device. Background technique [0002] In the process of real-time voice communication, various noise interference problems will be encountered, especially for mobile devices such as mobile phones, the problem of voice noise is particularly prominent. In addition, in the case of playing sound through a speaker, due to the echo problem, the sound quality of speech in this case is easily affected by external environmental noise and nonlinear residual echo compared with remote recording. [0003] In order to improve the quality of voice communication, it is necessary to denoise the voice to improve the clarity of the voice. Traditional speech denoising algorithms usually assume that the noise is additive and stable, and use Voice Activity Detection (Voice Activity Detection, VAD) technology to distinguish noisy speech i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L21/0216G10L25/78

Inventor 吴威麒张凯磊

Owner 苏州谦问万答吧教育科技有限公司

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Speech de-noising method and speech de-noising device

What is Al technical title? Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document. A voice denoising, non-speech technology, applied in voice analysis, instruments, etc., can solve problems such as poor results

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A voice denoising, non-speech technology, applied in voice analysis, instruments, etc., can solve problems such as poor results

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology