Time frequency mask-based single acoustic vector sensor (AVS) target voice enhancement method

A vector sensor and target voice technology, applied in voice analysis, instruments, etc., can solve the problems of restricting the application of small mobile devices, difficulty in enhancing target voice, large microphone array, etc., to achieve easy real-time operation, suppression of interference voice, suppression of background noise effect

Active Publication Date: 2014-10-15
SHENZHEN HIAN SPEECH SCI & TECH CO LTD
View PDF5 Cites 29 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The traditional single-channel speech enhancement method is usually simple to implement and has obvious effects on incoherent noise, but it is difficult to enhance the target speech in noisy human voice environments (multiple speakers exist); the speech enhancement technology based on microphone arrays uses the signal Space-time spectrum information has a strong ability to suppress spatial interference and noise, and can obtain better performance than single-channel speech enhancement, but the speech enhancement performance increases with the increase in the number of microphones, so the volume of the microphone array is large, which limits Applications of this type of technology on small mobile devices

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Time frequency mask-based single acoustic vector sensor (AVS) target voice enhancement method
  • Time frequency mask-based single acoustic vector sensor (AVS) target voice enhancement method
  • Time frequency mask-based single acoustic vector sensor (AVS) target voice enhancement method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] The present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0023] For example, the AVS received signal (1) is sampled at a sampling rate of 16kHz, and windowed and framed. The short time window of the frame is Hanning window, the window length K=1024 sampling points, the Fourier transform points are also K, and the frame shift 50%, get the time-spectrum data of two channels

[0024] X u ( k , l ) = u s S ( k , l ) + Σ i = 1 I u i N i ( k , ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a time frequency mask-based single acoustic vector sensor (AVS) target voice enhancement method. According to the method, the arrival angle of the target voice is known, a method of combining a fixed beam former and a post-positioned Wiener filter is adopted for realizing target voice enhancement, and calculation of the weight value of the post-positioned Wiener filter involves self-power spectrum estimation of the target voice. Time frequency sparse characteristics of a voice signal are used, the time frequency point correlation arrival angle for receiving audio signals is estimated through calculating the ISDR (Inter-sensor data ratio) of component signals outputted by two gradient sensors in the AVS, time frequency mask is designed through calculating errors between the time frequency point correlation arrival angle and a target arrival angle, and thus self-power spectrum estimation of the target voice is acquired. According to the method of the invention, any noise prior knowledge does not needed, the target voice can be effectively enhanced in a complicated environment where multiple speakers exist, and interference voice can background noise can be suppressed. In addition, the operation complexity is low, the adopted microphone array size is small (about 1cm<3>), and application on a portable device is excessively facilitated.

Description

technical field [0001] The invention relates to a single acoustic vector sensor target voice enhancement method based on a time-frequency mask, and belongs to the technical field of voice signal processing. Background technique [0002] Speech enhancement is one of the core technologies in the field of speech processing. In the actual complex environment, when the microphone picks up the voice signal, it will inevitably be interfered by the noise of the surrounding environment, the noise of the transmission medium, the internal electrical noise of the communication equipment, the reverberation of the room, and the voice of other speakers, so the quality of the voice picked up is affected. influences. In order to reduce the impact of noise on speech and obtain high-quality speech, the requirements for speech enhancement technology are put forward. The traditional single-channel speech enhancement method is usually simple to implement and has obvious effects on incoherent no...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L21/02G10L21/0224
Inventor 邹月娴王鹏石伟
Owner SHENZHEN HIAN SPEECH SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products