Unlock instant, AI-driven research and patent intelligence for your innovation.

Voice Activity Detection Based on Multiple Voice Activity Detectors

A voice activity, detector technology, applied in instrumentation, speech analysis, speech recognition, etc., can solve problems such as difficulty in distinguishing speech from noise or other sounds, low signal-to-noise ratio, etc.

Active Publication Date: 2016-06-01
QUALCOMM INC
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

One difficulty in detecting speech in noisy environments is the sometimes very low signal-to-noise ratio (SNR) encountered
In these situations, it is often difficult to distinguish speech from noise or other sounds using known VAD techniques

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice Activity Detection Based on Multiple Voice Activity Detectors
  • Voice Activity Detection Based on Multiple Voice Activity Detectors
  • Voice Activity Detection Based on Multiple Voice Activity Detectors

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0019] The following detailed description, which references and incorporates drawings, describes and illustrates one or more specific embodiments. These embodiments are provided not to be limiting, but merely to demonstrate and teach, these embodiments have been shown and described in sufficient detail to enable those skilled in the art to practice what is claimed. Therefore, for the sake of brevity, the description may omit certain information known to those skilled in the art.

[0020] The word "exemplary" is used throughout this disclosure to mean "serving as an example, instance, or illustration". Anything described herein as "exemplary" is not necessarily to be construed as preferred or advantageous over other methods or features.

[0021] In conventional speech processing systems, Voice Activity Detection (VAD) is typically estimated from an audio input signal, eg a microphone signal (eg of a mobile phone). VAD is an important function in many speech processing devices...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A voice activity detection VAD system includes a first voice activity detector, a second voice activity detector and control logic. The first voice activity detector is included in a device and generates a first VAD signal. The second voice activity detector is external to the device and generates a second VAD signal. The control logic combines the first and second VAD signals into a VAD output signal. Voice activity can be detected based on the VAD output signal. The second VAD signal may be represented as a flag included in packets containing digitized audio. The packet may be transmitted from the externally located VAD to the device via a wireless link.

Description

technical field [0001] This disclosure relates generally to speech processing, and more specifically, to voice activity detection. Background technique [0002] Voice activity detection (VAD) is a technique used in speech processing in which the presence or absence of human speech (voice) is detected in portions of an audio signal (which may also contain music, noise or other sounds). The main uses of VAD are in speech decoding and speech recognition. VAD can facilitate speech processing, and can also be used to deactivate some processes during non-speech segments: it can avoid unnecessary coding / transmission of silence, saving computation and network bandwidth. [0003] VAD is an important enabling technology for a variety of voice-based applications. Traditionally, VAD information is estimated locally from the input audio signal, usually in a single device, such as a communication handset. [0004] A VAD in a voice communication system should be able to detect speech in...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L25/78
CPCG10L25/78G10L15/00
Inventor 太元·李
Owner QUALCOMM INC