Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Voice detection method under noise condition

A speech detection and condition technology, applied in speech analysis, instruments, etc., can solve the problems of not considering speech characteristics, difficult to guarantee performance, and large amount of calculation.

Inactive Publication Date: 2011-02-09
HARBIN ENG UNIV
View PDF6 Cites 48 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

For example, "A Threshold Adaptive Speech Detection System" disclosed in the patent document with application number 200310103263.7 can automatically update the threshold, but needs fuzzy clustering processing and Bayesian information processing, so a large amount of calculation is required. And without considering the characteristics of the voice itself, it is difficult to guarantee the performance under complex background noise and low signal-to-noise ratio
As disclosed in the "Speech Detection System for Noisy Environment" disclosed in the patent document with the application number 99104095.3, the sub-band division of the voice is carried out, but it is only divided into two parts, the high frequency band and the low frequency band, and the voice itself is not considered. Band characteristics

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice detection method under noise condition
  • Voice detection method under noise condition
  • Voice detection method under noise condition

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021] The present invention is described in detail below in conjunction with accompanying drawing example:

[0022] The concrete steps of speech detection method are:

[0023] 1) Framing the input signal and transforming it into the frequency domain;

[0024] 2) Divide the frequency domain into a plurality of subbands of equal bandwidth, and calculate the subband power spectrum of each frame signal;

[0025] 3) If in the stage of initial noise estimation and initial speech detection threshold estimation, then carry out initial noise estimation and initial detection threshold processing, go to step 1), otherwise go to step 3);

[0026] 4) Subtracting the noise energy in each subband to obtain the denoised subband power spectrum;

[0027] 5) Calculate the mean square error of the power spectrum of each subband in each frame signal;

[0028] 6) comparing the mean square error of the subband power spectrum of each frame signal with the adaptive detection threshold;

[0029] 7...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a voice detection method under noise condition, and belongs to the technical fields of digital signal processing, computer artificial intelligence and pattern recognition. The method comprises the following steps of: converting input signals to a frequency domain, and dividing into subbands; calculating a power spectrum of each subband to form a subband power spectrum; calculating mean-square deviation of the subband power spectrum of each frame, and comparing the mean-square deviation serving as a detection characteristic with an adaptive voice detection threshold to determine whether the current frame contains voice signals; and according to a detection result, adopting a certain endpoint determination strategy to determine an initial position and an ending position of a voice segment.

Description

technical field [0001] The invention relates to a digital signal processing, computer artificial intelligence and pattern recognition technology, in particular to a method for using a computer to detect voice in a signal. Background technique [0002] The accuracy of speech detection determines the performance of the entire speech processing system to a large extent. People have done a lot of research on speech detection and proposed many various methods. For example, based on short-term energy and short-time spectrum energy , short-term zero-crossing rate and other speech detection algorithms. But these characteristic parameters are sensitive to background noise and cannot describe the characteristics of speech well. Like short-term energy and short-term zero-crossing rate, they are not enough when the signal-to-noise ratio is low. To distinguish speech and background noise. Speech detection algorithms based on linear prediction coefficients, cepstral coefficients, and pitc...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L11/02G10L21/0208
Inventor 刘冠群张汝波李雪耀徐东杨歌史长亭刘佰龙张子迎尹清波林俊宇
Owner HARBIN ENG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products