Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Spectrum-entropy improvement based speech endpoint detection method in low signal-to-noise ratio environment

A low signal-to-noise ratio, endpoint detection technology, applied in speech analysis, instruments, etc., can solve problems such as detection performance degradation

Inactive Publication Date: 2017-05-10
CHONGQING UNIV OF POSTS & TELECOMM
View PDF2 Cites 21 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Traditional endpoint detection algorithms, such as using zero-crossing rate, short-term energy and autocorrelation parameters, can obtain better detection results in high SNR environments, but their detection performance drops sharply in low SNR environments.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Spectrum-entropy improvement based speech endpoint detection method in low signal-to-noise ratio environment
  • Spectrum-entropy improvement based speech endpoint detection method in low signal-to-noise ratio environment
  • Spectrum-entropy improvement based speech endpoint detection method in low signal-to-noise ratio environment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] The technical solutions in the embodiments of the present invention will be described clearly and in detail below with reference to the drawings in the embodiments of the present invention. The described embodiments are only some of the embodiments of the invention.

[0026] The technical scheme that the present invention solves the above-mentioned technical problem is,

[0027] Let the time-domain waveform of the speech signal containing noise be x(n).

[0028] Step 1: Adopt a first-order high-pass filter, that is, a pre-emphasis filter, and its transfer function is H(z)=1-az -1 , the value of a is 0.95 obtained from multiple experiments. Then, the speech signal obtained after windowing and framing processing of the noisy speech signal is s w (n)=x(n)×w(n).

[0029] Step 2: After preprocessing, fast Fourier transform is performed on each frame of speech signal, and the speech signal is transformed from time domain data to frequency domain to obtain speech linear sp...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a spectrum-entropy improvement based speech endpoint detection method in a low signal-to-noise ratio environment. In order to solve the problem that a speech endpoint detection system is not high in accuracy rate in the low signal-to-noise ratio environment in recognition of a current speaker, the endpoint detection method capable of improving accuracy rate of speech endpoint detection in the low signal-to-noise ratio environment is provided, wherein the detection method comprises the following steps: (1) pre-processing a speech signal in accordance with characteristics of the signal; (2) in accordance with division of each frame of frequency band of the speech signal, calculating spectrum-entropy and energy of various sub-bands, so that an energy-entropy ratio SEH of the various sub-bands is finally obtained; and (3) setting a proper threshold value, and in combination with median filtering, obtaining starting and ending positions of speech. The invention aims at removing influence of environmental noise by conducting the median filtering, so that the speech signal is more stable, and the accuracy rate of endpoint detection in the low signal-to-noise ratio environment is improved.

Description

technical field [0001] The invention relates to the field of speech signal processing, and proposes a speech endpoint detection method based on spectral entropy improvement. Background technique [0002] Speech endpoint detection is an important step in speaker recognition, and effective endpoint detection is the first problem to be solved in speaker recognition processing. Traditional endpoint detection algorithms, such as using zero-crossing rate, short-term energy and autocorrelation parameters, can obtain better detection results in high SNR environments, but their detection performance drops sharply in low SNR environments. Contents of the invention [0003] The present invention aims to solve the above problems of the prior art. An improved endpoint detection method based on spectral entropy with higher accuracy is proposed. Technical scheme of the present invention is as follows: [0004] A speech endpoint detection method based on spectral entropy improvement un...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L25/87G10L25/84G10L25/21G10L25/18
CPCG10L25/87G10L25/18G10L25/21G10L25/84
Inventor 张毅王可佳颜博
Owner CHONGQING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products