Time frequency mask-based single acoustic vector sensor (AVS) target voice enhancement method

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A vector sensor and target voice technology, applied in voice analysis, instruments, etc., can solve the problems of restricting the application of small mobile devices, difficulty in enhancing target voice, large microphone array, etc., to achieve easy real-time operation, suppression of interference voice, suppression of background noise effect

Active Publication Date: 2014-10-15

SHENZHEN HIAN SPEECH SCI & TECH CO LTD

View PDF5 Cites 29 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

The traditional single-channel speech enhancement method is usually simple to implement and has obvious effects on incoherent noise, but it is difficult to enhance the target speech in noisy human voice environments (multiple speakers exist); the speech enhancement technology based on microphone arrays uses the signal Space-time spectrum information has a strong ability to suppress spatial interference and noise, and can obtain better performance than single-channel speech enhancement, but the speech enhancement performance increases with the increase in the number of microphones, so the volume of the microphone array is large, which limits Applications of this type of technology on small mobile devices

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0022] The present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0023] For example, the AVS received signal (1) is sampled at a sampling rate of 16kHz, and windowed and framed. The short time window of the frame is Hanning window, the window length K=1024 sampling points, the Fourier transform points are also K, and the frame shift 50%, get the time-spectrum data of two channels

[0024] X u ( k , l ) = u s S ( k , l ) + Σ i = 1 I u i N i ( k , ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a time frequency mask-based single acoustic vector sensor (AVS) target voice enhancement method. According to the method, the arrival angle of the target voice is known, a method of combining a fixed beam former and a post-positioned Wiener filter is adopted for realizing target voice enhancement, and calculation of the weight value of the post-positioned Wiener filter involves self-power spectrum estimation of the target voice. Time frequency sparse characteristics of a voice signal are used, the time frequency point correlation arrival angle for receiving audio signals is estimated through calculating the ISDR (Inter-sensor data ratio) of component signals outputted by two gradient sensors in the AVS, time frequency mask is designed through calculating errors between the time frequency point correlation arrival angle and a target arrival angle, and thus self-power spectrum estimation of the target voice is acquired. According to the method of the invention, any noise prior knowledge does not needed, the target voice can be effectively enhanced in a complicated environment where multiple speakers exist, and interference voice can background noise can be suppressed. In addition, the operation complexity is low, the adopted microphone array size is small (about 1cm<3>), and application on a portable device is excessively facilitated.

Description

technical field [0001] The invention relates to a single acoustic vector sensor target voice enhancement method based on a time-frequency mask, and belongs to the technical field of voice signal processing. Background technique [0002] Speech enhancement is one of the core technologies in the field of speech processing. In the actual complex environment, when the microphone picks up the voice signal, it will inevitably be interfered by the noise of the surrounding environment, the noise of the transmission medium, the internal electrical noise of the communication equipment, the reverberation of the room, and the voice of other speakers, so the quality of the voice picked up is affected. influences. In order to reduce the impact of noise on speech and obtain high-quality speech, the requirements for speech enhancement technology are put forward. The traditional single-channel speech enhancement method is usually simple to implement and has obvious effects on incoherent no...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L21/02G10L21/0224

Inventor邹月娴王鹏石伟

OwnerSHENZHEN HIAN SPEECH SCI & TECH CO LTD

Time frequency mask-based single acoustic vector sensor (AVS) target voice enhancement method

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology