Speech enhancement method and system fusing signal-to-noise ratio and intelligibility dual targets

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A speech enhancement and signal-to-noise ratio technology, applied in speech analysis, instruments, etc., can solve the problem that the enhancement result is not optimal in suppressing noise and improving speech intelligibility, so as to improve intelligibility and suppress noise residue Effect

Pending Publication Date: 2021-02-02

INST OF ACOUSTICS CHINESE ACAD OF SCI +1

View PDF0 Cites 1 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

For the various optimization objectives faced by existing speech enhancement methods, a single training criterion cannot comprehensively cover the errors obtained from all optimization objective angles, and usually only achieves the balance between suppressing noise residue and improving auditory quality and intelligibility. Balanced, the enhancement results are not optimal in terms of suppressing noise and improving speech intelligibility

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0057] The present invention will be further described now in conjunction with accompanying drawing.

[0058] The present invention proposes a speech enhancement method that integrates the dual objectives of SNR and intelligibility. Through two pre-established neural network models, the original time-frequency domain features are enhanced to obtain the SNR sense and intelligibility respectively. The optimal time-frequency domain speech components in the sense form the first effective feature and the second effective feature respectively; the first effective feature and the second effective feature are normalized column by column, and the weight matrix is obtained through point-to-point multiplication. According to the preset weight threshold, select the position in the weight matrix whose value is higher than the weight threshold, extract the value of the corresponding position in the second effective feature matrix to replace the value of the corresponding position in the fi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention belongs to the technical field of speech enhancement signal processing, and particularly relates to a speech enhancement method integrating a signal-to-noise ratio and intelligibility dual targets. The method comprises the following steps: converting an original speech signal into an original time-frequency domain feature; inputting the original time-frequency domain feature into a pre-established first neural network model to obtain a first effective feature with a signal-to-noise ratio; inputting the original time-frequency domain features into a pre-established second neural network model to obtain a second effective feature with intelligibility; processing the first effective feature and the second effective feature to obtain a weight matrix, selecting an element with high correlation with the first effective feature from the second effective feature column by column from the weight matrix according to a preset correlation weight threshold, extracting a correlation weight threshold of the element, replacing a threshold value at a corresponding position in the first effective feature with the correlation weight threshold, taking the replaced first effective featureas the time-frequency domain feature after speech enhancement, and converting the time-frequency domain feature after speech enhancement into an enhanced speech signal.

Description

technical field [0001] The invention belongs to the technical field of speech enhancement signal processing, and in particular relates to a speech enhancement method and system that integrates dual objectives of signal-to-noise ratio and intelligibility. Background technique [0002] When the speech signal is interfered by noise, the signal quality and intelligibility will decrease, thereby affecting the user experience of speech recognition and speech perception processing based on the speech signal. At present, the commonly used speech enhancement methods rely on estimating the mask of the speech signal, and then separate the spectral components of the speech signal from the noise coverage. This speech enhancement method is usually based on the minimum mean square error criterion, estimates a mask, classifies the speech signal components with noise in the time-frequency domain, distinguishes the components covered by noise, and retains the components with strong speech sig...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L21/0224G10L21/0232G10L25/30G10L25/45G10L25/60

CPCG10L21/0224G10L21/0232G10L25/30G10L25/45G10L25/60

Inventor 张鹏远战鸽颜永红

Owner INST OF ACOUSTICS CHINESE ACAD OF SCI

Speech enhancement method and system fusing signal-to-noise ratio and intelligibility dual targets

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology