Voice activity detection method combined with voice enhancement
A voice activity detection and voice enhancement technology, which is applied in voice analysis, neural learning methods, biological neural network models, etc., can solve the problems of limited VAD performance improvement, and achieve the effects of improving robustness, improving work efficiency, and high performance
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
specific Embodiment
[0063] In this embodiment, two groups of sub-experiments are set up. Groups 1 and 2 are intended to illustrate the improvement effect of the algorithm on VAD tasks and SE tasks.
[0064] Group I: The present invention expresses the proposed method as a joint model using mSI-SDR loss (Multi-mSS). For comparison with Multi-mSS, a joint model (Multi-SS) using SI-SDR loss and a model with only VAD features, denoted as single-VAD model, are trained. Multi-SS has exactly the same network structure as Multi-mSS. The target setting of its SE decoder is SI-SDR. The single-VAD model removes the SE decoder and uses only the VAD loss function vad as the optimization target. The receiver operating characteristic (ROC) curve, the area under the ROC curve (AUC) and the equal error rate (EER) were used as the evaluation indicators of VAD. The signal every 10ms is used as the value for calculating AUC and EER.
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


