Speech enhancement method and device thereof, equipment and medium

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
A voice enhancement and voice technology, applied in the field of signal processing, can solve problems such as long calculation time, high calculation cost, and unsatisfactory voice enhancement effect

Pending Publication Date: 2021-05-07

EVERSEC BEIJING TECH

View PDF0 Cites 1 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

In speech enhancement applications, other deep networks can also be used, for example, convolutional neural network (Convolutional Neural Network, CNN), deep neural network (Deep Neural Networks, DNN) and recurrent neural network (Recurrent Neural Network, RNN), etc. , but CNN and DNN can only process the frequency domain signal corresponding to the speech signal frame by frame, resulting in unsatisfactory speech enhancement effects, and because the speech signal itself has the characteristics of a large amount of data, the RNN and GAN methods are limited by recursion Calculation, unable to perform parallel calculations, resulting in long calculation time and high calculation costs

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0028] Figure 1a It is a flow chart of a speech enhancement method provided by Embodiment 1 of the present invention. The embodiment of the present invention is applicable to the situation where the speech noise suppression model based on the attention mechanism is used to perform speech enhancement processing on noisy speech signals. The method can be implemented by this The voice enhancement device provided by the embodiment of the invention can be implemented by means of software and / or hardware, and can generally be integrated into computer equipment, such as vehicle-mounted terminal equipment.

[0029] Such as Figure 1a As shown, the speech enhancement method provided in this embodiment specifically includes:

[0030] S110. Acquire a target noisy speech signal, and perform a short-time Fourier transform on the target noisy speech signal to obtain a target frequency domain signal corresponding to the target noisy speech signal.

[0031] The target noisy speech signal ref...

Embodiment 2

[0075] Figure 2a It is a flowchart of a speech enhancement method provided by Embodiment 2 of the present invention. This embodiment is embodied on the basis of the above embodiments, wherein, before acquiring the target noisy speech signal, it may also include:

[0076] Short-time Fourier transform is performed on the speech noise sample signal and the speech sample signal to obtain a first frequency domain signal corresponding to the speech noise sample signal and a second frequency domain signal corresponding to the speech sample signal; wherein, the speech The noisy sample signal is generated by superimposing the noise signal on the basis of the speech sample signal;

[0077] When the speech noise suppression model is trained, the feature of the current signal frame of the first frequency domain signal is input in the encoder to obtain the encoding feature corresponding to the current signal frame of the first frequency domain signal;

[0078] Inputting the encoding fea...

Embodiment 3

[0124] image 3 It is a schematic structural diagram of a speech enhancement device provided in Embodiment 3 of the present invention. The embodiment of the present invention is applicable to the situation where the speech noise suppression model based on the attention mechanism is used to perform speech enhancement processing on noisy speech signals. The device can use It can be implemented in the form of software and / or hardware, and can generally be integrated in computer equipment.

[0125] Such as image 3 As shown, the data query device specifically includes: a target frequency domain signal generation module 310 , an encoding feature generation module 320 , a decoding feature generation module 330 and a target enhanced speech signal generation module 340 . in,

[0126] The noisy speech signal processing module 310 is configured to obtain a target noisy speech signal, perform a short-time Fourier transform on the target noisy speech signal, and obtain a target frequenc...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a speech enhancement method and a device thereof, equipment and a medium. The method comprises the following steps: acquiring a target noisy voice signal and performing short-time Fourier transform on the target noisy voice signal to obtain a target frequency domain signal corresponding to the target noisy voice signal; inputting the target feature of the current signal frame of the target frequency domain signal into an encoder in a voice noise suppression model obtained by pre-training to obtain an encoding feature corresponding to the current signal frame of the target frequency domain signal; inputting the coding feature and a decoding feature corresponding to a previous signal frame of a current signal frame of a target frequency domain signal output by a decoder in a voice noise suppression model into the decoder to obtain a decoding feature corresponding to the current signal frame of the target frequency domain signal; and performing signal reconstruction on the decoding features corresponding to each signal frame of the target frequency domain signal to obtain a target enhanced voice signal corresponding to the target noisy voice signal. According to the technical scheme, the speech enhancement effect can be improved, and calculation time and calculation cost are reduced.

Description

technical field [0001] Embodiments of the present invention relate to the technical field of signal processing, and in particular, to a speech enhancement method, device, equipment, and medium. Background technique [0002] The task of speech enhancement is to maximize the perceived quality of the speech signal and suppress the interference of background noise. Speech enhancement technology is generally based on the frequency domain signal of the speech signal or the signal characteristics of the speech signal. In the traditional method, the methods used for speech enhancement mainly include: spectral subtraction, Wiener filtering method, least quadratic method based on statistical features Most of these algorithms deal with limited conditions of noise types and rely on first-order statistical properties. To circumvent the limitations in these algorithms, deep networks have been increasingly used in noise suppression problems. [0003] At present, the methods of deep netwo...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L21/0208G10L21/0232G10L21/0264G10L25/30

CPCG10L21/0208G10L21/0232G10L21/0264G10L25/30

Inventor梁彧傅强马多佳田野杨满智蔡琳王杰金红陈晓光

OwnerEVERSEC BEIJING TECH

Speech enhancement method and device thereof, equipment and medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements:Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology