Speech enhancement method and device

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
A speech enhancement, pure speech technology, applied in speech analysis, baseband system components, instruments, etc., can solve problems such as a priori signal-to-noise ratio error, pure speech signal error, etc., to achieve the effect of reducing errors

Inactive Publication Date: 2012-04-04

HUAWEI TECH CO LTD

View PDF0 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] In the estimation technology based on the minimum mean square error, it is necessary to calculate the prior signal-to-noise ratio through the Decision-Directed Approach method to obtain a pure speech signal. However, the inventor found in the research that in the existing estimation technology based on the minimum mean square error, There are at least the following problems in the calculation of the prior SNR: the calculation of the prior SNR of the current data frame depends on the information of the previous frame of the current data frame, however, there is a gap between the previous frame of the current frame and the current frame This difference will lead to the same error in the prior signal-to-noise ratio, and eventually lead to a large error between the pure speech signal obtained by the speech enhancement technology and the real pure speech signal

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0030] see figure 1 , which is a flowchart of an embodiment of a method for speech enhancement of the present invention, the method includes the following steps:

[0031] Step 101: Transform the noisy speech signal to obtain a noisy speech signal in the frequency domain;

[0032] Step 102: Set the spectral variance of the previous frame of the frequency domain noisy speech signal and the weight of the square of the spectral amplitude of the previous frame by using the correlation correction parameter to obtain the spectral variance of the current frame in the pure speech signal in the frequency domain, where the said correlation correction parameter indicates a correlation between said current frame and said previous frame;

[0033] Wherein, the weight of the previous frame spectral variance and the square of the previous frame spectral amplitude is set according to the correlation correction parameter, and the spectral variance of the current frame in the pure speech signal ...

Embodiment 2

[0047]In this embodiment, the minimum mean square error estimation method for speech enhancement using the priori signal-to-noise ratio that introduces weights will be described in detail. Please refer to figure 2 As shown, it is a block diagram of the principle of speech enhancement by the minimum mean square error estimation method in the present invention, combined with figure 2 , see image 3 , which is a flow chart of a specific embodiment of a speech enhancement method of the present invention, specifically comprising the following steps:

[0048] Step 301: Obtain a speech signal with noise;

[0049] Wherein, the band noise speech signal that is set to obtain is y (n), comprises pure speech signal x (n) and noise signal d (n);

[0050] Step 302: performing Fourier transform on the acquired noisy speech signal to obtain a noisy speech signal in the frequency domain;

[0051] Wherein, it is set that the noisy speech signal y(n) is Y(k) after Fourier transform, includi...

Embodiment 3

[0075] Corresponding to the above speech enhancement method, an embodiment of the present invention also provides a speech enhancement device. see Figure 7 , which is a structural diagram of an embodiment of a speech enhancement device in the present invention, the device includes: a frequency domain transformation unit 701 , a spectral variance correction unit 702 , a priori signal-to-noise ratio acquisition unit 703 and a speech enhancement unit 704 . The internal structure and connection relationship of the device will be further introduced below in conjunction with the working principle of the device.

[0076] A frequency-domain transformation unit 701, configured to perform frequency-domain transformation processing on the noisy time-domain speech signal to obtain a noisy frequency-domain speech signal;

[0077] The spectral variance correction unit 702 is used to set the weight of the previous frame spectral variance and the square of the previous frame spectral amplit...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The embodiment of the invention discloses a speech enhancement method and a speech enhancement device. The method comprises the following steps of: converting a speech signal with noise to obtain a frequency-domain speech signal with the noise; setting weight values of the spectral variance and spectrum amplitude of a previous frame in the frequency-domain speech signal with the noise by using a correlation degree correction parameter to obtain the spectral variance of a current frame in a pure frequency-domain speech signal, wherein the correlation degree correction parameter indicates the degree of correlation between the current frame and the previous frame; obtaining the prior signal to noise ratio of the current frame in the pure frequency-domain speech signal according to the spectral variance of the current frame in the pure frequency-domain speech signal and the spectral variance of the previous frame in the frequency-domain speech signal with the noise; and obtaining an enhanced pure frequency-domain speech signal by a least-mean-square error estimation method according to the prior signal to noise ratio of the current frame in the pure frequency-domain speech signal. Through the embodiment of the invention, errors introduced by the calculation of the prior signal to noise ratio in a speech enhancement process can be reduced.

Description

technical field [0001] The invention relates to the technical field of voice communication, in particular to a voice enhancement method and device. Background technique [0002] Realistic voice communication may occur in a noisy environment. For example, mobile phone communication in a factory will be affected by the roar of machinery; voice communication in a train cab will be disturbed by motor running and rail crashing. Speech enhancement is to extract the original speech as pure as possible from the noisy speech signal, thereby improving the speech quality, clarity and intelligibility of the speech. [0003] In voice communication technology, voice enhancement technology has been widely used. There are two main purposes of speech enhancement: one is to improve speech quality and eliminate background noise so that the listener can accept it without fatigue; the other is to improve the intelligibility of speech. Among them, due to the different noise characteristics, the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityPatents(China)

IPC IPC(8): G10L21/02H04L25/02G10L21/0216G10L21/0232

Inventor杨毅张清

OwnerHUAWEI TECH CO LTD

Speech enhancement method and device

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology