Voice activation detection (VAD), and method and apparatus for the VAD
A feature parameter, current frame technology, applied in the field of activation sound detection, can solve the problems of good performance, error detection, low VAD efficiency, etc., and achieve the effect of good performance
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 2
[0136] Embodiment 2 of the activation tone detection (VAD) method of the present invention performs polyphase filtering on the input audio signal in sub-frames to obtain the filter bank sub-band signal, and further performs time-frequency conversion on the filter bank sub-band signal, and calculates Spectrum amplitude, signal feature extraction is performed on each filter bank sub-band signal and spectrum amplitude respectively, and each characteristic parameter value is obtained. According to the value of the characteristic parameter, the background noise mark and the tonality mark of the current frame are obtained. According to the current frame energy parameter value and background noise energy calculation, the SNR parameter of the current frame is obtained, and according to the calculated SNR parameter of the current frame, the VAD (Voice Activity Detection, Voice Activity Detection) judgment result and each feature of the previous frame Parameter to determine whether the ...
Embodiment 1
[0213] In Embodiment 1 and Embodiment 2, the process of obtaining the VAD judgment result is calculated according to the tonality flag, the signal-to-noise ratio parameter, the spectral center of gravity characteristic parameter, and the frame energy parameter, such as image 3 Shown include the following steps:
[0214] Step 301: Calculate the long-term signal-to-noise ratio lt_snr through the ratio of the average long-term activation tone signal energy and the average long-term background noise energy calculated in the previous frame;
[0215] Average long-duration activation tone signal energy E fg and the average long-term background noise energy E bg See step 307 for the calculation and definition of . The long-term signal-to-noise ratio lt_snr calculation equation is as follows:
[0216] In this formula, the long-term signal-to-noise ratio lt_snr is expressed in logarithm.
[0217] Step 302: Calculate the average value of the full-band SNR SNR2 of several recent fr...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com