Phonetic end point detection method and device therefor

An endpoint detection and voice technology, applied in voice analysis, instruments, etc., can solve the problems of voice being susceptible to noise pollution, performance degradation, etc., and achieve the effect of improving accuracy and precision, and improving communication signal-to-noise ratio

Inactive Publication Date: 2009-12-09
CHINA AGRI UNIV +1
View PDF0 Cites 72 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The embodiment of the present invention provides a speech endpoint detection method and device, which are used to solve the problem in the prior art that speech recognition is easily polluted by noise and cause performance degradation in a low signal-to-noise ratio and complex

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Phonetic end point detection method and device therefor
  • Phonetic end point detection method and device therefor
  • Phonetic end point detection method and device therefor

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0024] In a speech recognition system, the spectral distribution of speech is usually more structured than the spectral distribution of noise, and this difference is usually described by spectral entropy. And according to this characteristic of spectral entropy, by measuring the spectral entropy of the speech time series, using the characteristic that the spectral entropy value of the random noise segment of the non-speech segment is greater than the spectral entropy value of the speech segment, the voice endpoint can be detected. Simply put, the spectral entropy voice endpoint detection method is to detect the flatness of the spectrum to achieve the purpose of voice endpoint detection. For the non-speech segment, its energy distribution in each frequency is relatively stable, which is reflected in the amount of information, and it is considered that the average amount of information contained in it, that is, the spectral entropy is large; for the speech segment, its energy is con...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a phonetic end point detection method and a device therefor. The phonetic end point detection method comprises: receiving phonetic data with noise, dividing the phonetic data with noise into plural overlapped phonetic frames, carrying out rapid Fourier variation calculation to obtain the frequency spectrum of each phonetic frame; dividing the frequency spectrum of each phonetic frame into plural sub-bands uniformly, and generating sub-band power spectral entropy density according to sub-band energy; weighing the sub-band power spectral entropy density to obtain sub-band weight power spectral entropy of each phonetic frame; judging the current phonetic frame as a noise section or phonetic section according to preset phonetic end point judgment threshold and sub-band weight power spectral entropy. The invention adopts sub-band power spectral entropy as phonetic eigenvalue of VAD judgment, and adaptively selects sub-band number and weight factor according to actual application environment, thus improving accuracy and preciseness of phonetic detection and obviously enhancing signal-to-noise ratio of communication.

Description

technical field [0001] The invention relates to the technical field of voice recognition, in particular to a voice endpoint detection method and device. Background technique [0002] Realizing clear voice communication in a high-noise environment is an urgent problem to be solved by many scientists and engineers. Communication in a high-noise environment, the background noise interferes a lot with the voice signal, which can lead to unclear calls and low intelligibility in the communication system. Therefore, how to maintain a high-quality, high-definition communication system in a complex background noise environment is particularly important. [0003] In the speech system, background noise is often input together with the speech signal, so how to accurately judge the presence or absence of the speech signal in the input signal and determine its starting and ending positions becomes a problem of suppressing and removing speech noise. The key point of the voice endpoint de...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L11/02G10L19/022
Inventor 刘珩程小桐刘荣袁伟军李俊俊李娟蔡乃小于宁
Owner CHINA AGRI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products