Voice activity detection method and equipment based on time domain and frequency domain

A voice activity detection, frequency domain technology, applied in speech analysis, speech recognition, instruments, etc., can solve problems such as low signal-to-noise ratio, background noise is human voice, etc.

Active Publication Date: 2015-03-25
BEIJING UNISOUND INFORMATION TECH +1
View PDF11 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in actual use, there are often situations where the signal-to-noise ratio is low and the background noise is also human voice. At this time, it is difficult to obtain sufficiently accurate results using traditional methods.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice activity detection method and equipment based on time domain and frequency domain
  • Voice activity detection method and equipment based on time domain and frequency domain
  • Voice activity detection method and equipment based on time domain and frequency domain

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] Specific embodiments of the present invention will be described in detail below in conjunction with the accompanying drawings. It should be understood that the specific embodiments described here are only used to illustrate and explain the present invention, and are not intended to limit the present invention.

[0026] figure 1 A flow chart of a voice activity detection method based on time domain and frequency domain according to an embodiment of the present invention is shown. Such as figure 1 As shown, the method can include: step S101, adding white noise to the input speech signal; step S102, carrying out frame processing to the speech signal after adding white noise; step S103, determining the short-term energy value of each frame; step S104, determine the harmonic product spectrum value of each frame; and step S105, for each frame, determine whether the frame is a speech frame according to the short-term energy value of the frame and the harmonic product spectru...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a voice activity detection method and equipment based on a time domain and a frequency domain. The method comprises the steps that noise energy of voice signals is estimated; white noise is added into the input voice signals; the voice signals after the white noise is added are processed in a framing mode; a short-time energy value of each frame is determined; a harmonic wave product spectrum value of each frame is determined; for each frame, whether the frame is a voice frame is determined according to the short-time energy value and the harmonic wave product spectrum value of the frame, and voice segments contained in the voice signals are obtained. According to the voice activity detection method and equipment, a time domain analysis method and a frequency domain analysis method are combined, the adaptability is high for the actual condition, performance is good under the circumstances that the background noise is human voice and the signal to noise ratio is low, the method is easy and ingenious to realize, and the equipment can be embedded into various voice systems easily.

Description

technical field [0001] The present invention relates to the field of voice activity detection, in particular to a voice activity detection method and device based on time domain and frequency domain. Background technique [0002] Voice Activity Detection (Voice Activity Detection, VAD) is a speech processing technology for detecting the presence of a speech signal. Speech activity detection technology is mainly used for speech recognition, speech coding, etc. It can distinguish silence and speech fragments, paving the way for further processing of speech signals. The voice activity detection module is also an integral part of many voice communication systems, such as audio conferencing, voice recognition, echo cancellation, IP telephony, etc. For the speech recognition system, the accuracy of the speech activity detection module will greatly affect the subsequent work of feature extraction, model building and judgment. Therefore, it is particularly important to provide effi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/04
Inventor 关海欣
Owner BEIJING UNISOUND INFORMATION TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products