Noise robustness endpoint detection method based on likelihood ratio test

A noise-robust and endpoint-detection technology, applied in speech analysis and instrumentation, can solve the problems of detection performance degradation, truncation effect, loss of speech start or end sound, etc.

Inactive Publication Date: 2014-04-16
上海交通大学无锡研究院
View PDF5 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, there are still some problems and deficiencies in the existing speech endpoint detection technology, especially in the actual channel environment, because the spectral characteristics of the unvoiced and fricative components of the speech signal are very similar to the noise, and most of the existing endpoint detection algorithms The distinction between speech and noise is based on the syllable characteristics of the speech itself, so in the process of detecting endpoints, the start or end of the speech may be lost, resulting in a truncation effect
At the same time, most algorithms cannot fully retain all speech information, and when the signal-to-noise ratio decreases, the detection performance will also decrease significantly

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Noise robustness endpoint detection method based on likelihood ratio test
  • Noise robustness endpoint detection method based on likelihood ratio test
  • Noise robustness endpoint detection method based on likelihood ratio test

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0054] In the real environment, the noisy speech signal that our human ears can hear can be obtained by superimposing the clean speech signal and the interference signal, and the strength of the noise signal will obviously affect the performance of speech endpoint detection. At the same time, a variety of scientific research results have proved that the speech endpoint detection performance under strong SNR is significantly better than that under low SNR. Therefore, in this claim, the Wiener filter is first used to perform speech enhancement on the noisy speech signal, which can not only reduce the influence of the clean speech of the noise signal, but also have better stationary characteristics of the filtered noise signal.

[0055] Below in conjunction with accompanying drawing, technical scheme of the present invention is described in further detail:

[0056] The noisy speech x(n) is obtained by superimposing the clean speech s(n) and the interference noise d(n):

[0057] ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a noise robustness endpoint detection method based on a likelihood ratio test. The improvement is achieved from the three aspects of signal to noise ratio estimation, threshold value robustness setting and trailing distortion elimination respectively, so that the suggested algorithm has a better detection property under a low signal to noise ratio environment, in particular under a non-stationary noise environment compared with the prior art. The method and a multi-observation likelihood ratio test algorithm based on harmonic wave features have similar voice boundary detection accuracy, however, the method can have better voice detection precision than the multi-observation likelihood ratio test algorithm based on the harmonic wave features, and therefore it can be proved that the method is more excellent in performance than a traditional method. Meanwhile, the method has the similar performance under the 15dB and 25dB signal to noise ratio, and it shows that the method has good robustness to noise. The noise robustness endpoint detection method can be used as an important and effective method for front end preprocessing of a voice recognition system or a voiceprint recognition system in an actual environment, and thus good application value can be achieved.

Description

technical field [0001] The invention discloses a noise robust endpoint detection method based on likelihood ratio test, which relates to the fields of speech processing and signal processing. Background technique [0002] Voice endpoint detection (VAD) is a very critical part of speech processing related technologies. It can be used not only for speech / non-speech detection in speech enhancement, but also for feature extraction and speech signal reverberation. The existing speech signal endpoint detection algorithms are mainly divided into three categories: endpoint detection methods based on time domain, endpoint detection methods based on frequency domain and endpoint detection methods based on model statistics. [0003] In practical applications, high-precision speech endpoint detection plays an extremely important role in subsequent speech enhancement, endpoint detection, speech recognition or voiceprint recognition. However, there are still some problems and deficiencie...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L21/0208G10L25/78
Inventor 包旭雷李为姚国勤朱杰董斌杭乐
Owner 上海交通大学无锡研究院
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products