Beamforming method and system based on time-frequency masking value estimation

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of time-frequency masking and beamforming, applied in speech analysis, speech recognition, instruments, etc., can solve the problems of not fully utilizing phase information, long iteration time, affecting performance, etc., to solve the mismatch of training and test data, and good application Foreground, Accuracy Improvement Effect

Active Publication Date: 2021-10-29

PLA STRATEGIC SUPPORT FORCE INFORMATION ENG UNIV PLA SSF IEU +1

View PDF3 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

In the existing speech enhancement processing, the time-frequency masking value estimation based on the neural network has the problem of training-test data mismatch, which affects the performance, and the time-frequency masking value estimation based on the spatial domain clustering has the problem of long iteration time. The value masking value uses the amplitude information of the feature, but does not make full use of the phase information in the feature. The estimation accuracy of the probability of speech and noise needs to be improved.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0026] In order to make the purpose, technical solution and advantages of the present invention more clear and understandable, the present invention will be further described in detail below in conjunction with the accompanying drawings and technical solutions.

[0027] Embodiment of the present invention, see figure 1 As shown, a beamforming method based on time-frequency masking value estimation is provided for speech enhancement in speech recognition applications, including the following content:

[0028] S101. Obtain a multi-channel speech sequence, perform Fourier transform on the speech sequence and extract amplitude spectrum features and spatial features;

[0029] S102. Obtain a multi-channel speech spectrum feature sequence by logarithmically transforming the amplitude spectrum feature; send the multi-channel speech spectrum feature sequence into a pre-trained and optimized neural network model, and obtain a complex-valued time-frequency masking value through the neura...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention belongs to the technical field of speech enhancement, and in particular relates to a beamforming method and system based on time-frequency masking value estimation. Features The multi-channel speech spectrum feature sequence is obtained through logarithmic transformation, which is sent to the pre-trained and optimized neural network model to obtain the complex-valued time-frequency masking value; the complex-valued time-frequency masking value is converted into the probability of speech existence, and the time-frequency masking is obtained by using the probability model value; calculate the covariance matrix of the speech signal from the time-frequency masking value and the multi-channel speech eigensequence, and perform eigenvalue decomposition on the covariance matrix to obtain the beamforming filter coefficients; combine the beamforming filter coefficients, use the beamforming filter to The speech sequence is processed by speech feature filtering to obtain an enhanced speech signal. The invention integrates the neural network and the spatial clustering to estimate the time-frequency masking value, and improves the performance of beam forming and voice recognition.

Description

technical field [0001] The invention belongs to the technical field of speech enhancement, and in particular relates to a beamforming method and system based on time-frequency masking value estimation. Background technique [0002] Speech coding and speech recognition research is often carried out under laboratory conditions, that is, in environments with high signal-to-noise ratios or in noise-free environments. Therefore, when speech processing moves from the laboratory to practical application, many methods cannot be used due to the existence of actual environmental noise and interference, and the performance drops rapidly. Therefore, it is a practical problem that must be solved to study the processing of improving the auditory effect or improving the signal-to-noise ratio of the noise-reduced speech. The essence of speech enhancement is speech noise reduction. In other words, in daily life, the speech collected by the microphone is usually "polluted" speech with differ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G10L21/0216G10L25/30G10L15/20

CPCG10L15/20G10L21/0216G10L25/30G10L2021/02166

Inventor 屈丹郭晓波杨绪魁邱泽宇李真郝朝龙魏雪娟

Owner PLA STRATEGIC SUPPORT FORCE INFORMATION ENG UNIV PLA SSF IEU

Beamforming method and system based on time-frequency masking value estimation

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology