Multichannel voice detection in adverse environments

A voice and detector technology, applied in the field of voice activity detection system, can solve complex VAD problems and other problems

Inactive Publication Date: 2005-10-05
SIEMENS AG
View PDF0 Cites 21 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The diversity and changing nature of speech and background noise complicates the VAD problem

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multichannel voice detection in adverse environments
  • Multichannel voice detection in adverse environments
  • Multichannel voice detection in adverse environments

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0016] Preferred embodiments of the present invention will be described hereinafter with reference to the accompanying drawings. In order not to obscure the invention in unnecessary detail, in the following description, well-known functions or constructions are not described in detail.

[0017] A multi-channel VAD (Voice Activity Detection) system and method for determining the presence or absence of speech in a signal is provided. Spatial localization is key to supporting the invention, which can be used equally for speech and non-speech signals of interest. To illustrate the invention, assume the following situation: a target source (such as a speaking person) is located in a noisy environment, and two or more microphones record an audio mix. For example, if Figure 1A with Figure 1B As shown, two signals are measured in a car by two microphones (one microphone 102 is fixed in the car and the second microphone 104 may be fixed in the car or located in a mobile phone 106)....

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A multichannel source activity detection system, e.g., a voice activity detection (VAD) system, and method that exploits spatial localization of a target audio source is provided. The method includes the steps of receiving a mixed sound signal by at least two microphones (102, 104); Fast Fourier transforming each received mixed sound signal into the frequency domain (110); filtering the transformed signals to output a signal corresponding to a spatial signature of a source (120); summing an absolute value squared of the filtered signal over a predetermined range of frequencies (122); and comparing the sum to a threshold to determine if a voice is present (124). Additionally, the filtering step includes multiplying the transformed signals by an inverse of a noise spectral power matrix (132), a vector of channel transfer function ratios (130), and a source signal spectral power (128).

Description

technical field [0001] The present invention generally relates to digital signal processing systems, and more particularly, the present invention relates to voice activity detection systems and methods in hostile environments, such as noisy environments. Background technique [0002] In the practice of digital processing, voice (and more generally sound source) activity detection (VAD) is a fundamental problem, and VAD often has a greater impact on the overall performance of the system than any other component. Speech coding under noisy conditions, multimedia communication (speech and data), speech enhancement, and speech recognition are very important applications where a good VAD method or system can substantially enhance the performance of the respective systems. The task of the VAD method is mainly to extract features of the acoustic signal that highlight the difference between speech and noise and classify them to make the final VAD decision. The variety and changing n...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L11/02G10L21/02
CPCG10L25/78G10L2021/02165
Inventor R·V·巴兰J·罗斯卡C·博格安特
Owner SIEMENS AG
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products