A Novel Dual Microphone Speech Detection and Enhancement Method

A voice detection and dual-microphone technology, applied in voice analysis, voice recognition, instruments, etc., can solve problems such as difficulty, error, and voice distortion

Active Publication Date: 2021-03-26
成都启英泰伦科技有限公司
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] There are currently three methods for automatic speech detection, which are the short-term energy in the time domain, the zero-crossing rate, and the mean square of frequency-band energy in the frequency domain. Then it is compared with an empirical threshold. Practical applications show that these three methods have two main disadvantages: 1. With a fixed threshold, it is impossible to achieve good performance when the noise environment changes, and in practical applications the noise is usually Changeable, it is difficult to find a suitable fixed threshold to adapt to most noise scenarios; 2. The method of comparing the short-term energy or zero-crossing rate alone has unstable performance and low accuracy when the noise energy is strong. At the same time, if the accuracy of voice detection is low, statistical information such as noise power spectrum will be inaccurate, or voice information will be included by mistake, resulting in voice distortion

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Novel Dual Microphone Speech Detection and Enhancement Method
  • A Novel Dual Microphone Speech Detection and Enhancement Method
  • A Novel Dual Microphone Speech Detection and Enhancement Method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0081] The present invention will be described in further detail below in conjunction with the examples and specific implementation methods, but this should not be interpreted as the scope of the above-mentioned subject of the present invention being limited to the following examples, and all technologies realized based on the content of the present invention belong to the present invention scope.

[0082] Such as figure 1 Shown, a kind of novel two-microphone speech detection and enhancement method, it comprises the following steps:

[0083] Step 1, loading the current frame data, the current frame data is voice data in the time domain;

[0084] Step 2: Convert the speech data in the time domain into speech data in the frequency domain by fast Fourier transform (FFT), corresponding to the nth time frame, and the speech data in the time domain is abbreviated as [y m ((n-1)L w +1), y m ((n-1)L w +2),…,y m (nL w )], m=1,2, wherein m represents the label of two microphones...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the field of speech information processing technology and microphone array signal processing and especially relates to the fields of voice activity detection, voice detectionand speech recognition and interaction and the like. The method can effectively detect time frame of a voice activity and can also carry out dynamic regulation on threshold according to noise transform through two dynamic threshold update policies, by taking variability of noise environment into full consideration and based on three groups of auditory features capable of reflecting the ratio of noise energy in the total energy. The detection result can be corrected again through a detection result buffer mode, thereby preventing the defect of detection miss between continuous active voice frames; a noise power spectral density matrix is subjected to adaptive update according to the speech detection result; and furthermore, speech enhancement can be carried out through a Weiner filter, so that the noise can be suppressed under the minimum mean square error criterion.

Description

technical field [0001] The invention relates to the field of speech recognition and detection, in particular to a method for double-microphone speech detection and enhancement based on a dynamic threshold updating strategy. Background technique [0002] Affected by machine learning technologies such as deep neural networks, the accuracy of speech recognition has been greatly improved, and speech recognition has begun to be widely used in various fields. At present, speech recognition technology is mostly used in various electronic devices such as mobile phones, air conditioners, and TVs. Compared with traditional remote controls, the human-computer interaction technology of speech recognition is more convenient, and it is a new generation of information query and information recommendation without interactive interface. The key to human-computer interaction technology. [0003] At present, in the absence of strong noise interference and near-speaking, the accuracy of speech...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/04G10L21/02G10L21/04
CPCG10L15/04G10L21/02G10L21/04G10L2021/02165
Inventor 何云鹏高君效张来许兵
Owner 成都启英泰伦科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products