Speech enhancement method, device and electronic equipment based on spatial features

A speech enhancement and spatial feature technology, applied in speech analysis, instruments, etc., can solve the problems of insufficient precision and low quantization efficiency, and achieve the effect of avoiding speech distortion and reducing noise

Active Publication Date: 2022-04-29
北京清微智能信息技术有限公司
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In order to solve the problems of insufficient precision and low quantization efficiency of existing quantization methods, embodiments of the present invention provide a quantization method, device and electronic equipment for neural networks

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech enhancement method, device and electronic equipment based on spatial features
  • Speech enhancement method, device and electronic equipment based on spatial features
  • Speech enhancement method, device and electronic equipment based on spatial features

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0078] In order to make the object, technical solution and advantages of the present invention clearer, the embodiments of the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0079] see figure 1 , a method for speech enhancement based on spatial features provided by an embodiment of the present invention, the method includes:

[0080] S100. Perform Fourier transform on the dual-channel noisy speech to obtain a dual-channel complex spectrum represented by the dual-channel noisy speech in the frequency domain.

[0081] S110. Obtain, based on beamforming, a first single-channel complex spectrum of the two-channel complex spectrum in the target speech angle direction and a second single-channel complex spectrum of the two-channel complex spectrum in a direction different from the target speech angle by a predetermined angle.

[0082] In implementation, the beamforming formula is shown in the following equation (1):

[...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a speech enhancement method, device and electronic equipment based on spatial features. The method includes: performing Fourier transform on a dual-channel noisy speech to obtain a dual-channel complex spectrum; obtaining the first dual-channel complex spectrum based on beamforming A single-channel complex spectrum and a second single-channel complex spectrum; calculating the logarithmic power spectrum of the first single-channel complex spectrum; calculating a direction energy ratio based on the energy of the first single-channel complex spectrum and the energy of the second single-channel complex spectrum, and Take the logarithm to obtain the energy ratio in the logarithmic direction; use the logarithmic power spectrum and the energy ratio in the logarithmic direction as feature inputs to the pre-trained speech enhancement neural network to obtain the masking value; add the masking value to the first single-channel complex spectrum, and The first single-channel complex spectrum after the masking process is inversely Fourier-transformed to obtain the enhanced speech. The solution provided by the embodiment of the present invention can effectively reduce noise and better avoid voice distortion.

Description

technical field [0001] The invention relates to the technical field of speech enhancement, in particular to a speech enhancement method, device and electronic equipment based on spatial features. Background technique [0002] Speech enhancement has always played an important role in the field of speech signal processing. The traditional speech enhancement method mainly estimates the spectral information of the noise, and then subtracts the noise from the original speech spectrum. However, sudden noise and random noise will make the spectrum The estimation of information becomes difficult. At the same time, the traditional method also needs to make independent assumptions on the signal and Gaussian assumptions on the feature distribution in advance. These assumptions are equivalent to setting boundaries for speech enhancement, resulting in limited noise reduction effects. [0003] Based on this, the neural network based on deep learning is widely used in the field of speech e...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G10L21/02G10L21/0232G10L25/18G10L25/21G10L25/30
CPCG10L21/0232G10L25/30G10L25/21G10L25/18
Inventor 苏家雨王博欧阳鹏
Owner 北京清微智能信息技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products