Speech enhancement method for speech recognition in noise environment

A speech enhancement and speech recognition technology, applied in speech recognition, speech analysis, instruments, etc.

Active Publication Date: 2018-11-16
GUILIN UNIV OF ELECTRONIC TECH
View PDF7 Cites 43 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The purpose of the present invention is to solve the problem that the speech recognition rate is high in a quiet environment and the recognition rate drops sharply in a noisy environment, and proposes a speech enhancement method applied to speech

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech enhancement method for speech recognition in noise environment
  • Speech enhancement method for speech recognition in noise environment
  • Speech enhancement method for speech recognition in noise environment

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0069] Examples:

[0070] Such as figure 1 As shown, a speech enhancement method applied to speech recognition in a noisy environment is to construct an MVDR beamformer based on time-frequency masking, and post an improved Wiener filter to perform speech enhancement processing on the target sound source direction. Including the following steps:

[0071] 1) The four-element microphone array model is used to receive the voice signal, and the time domain of the noisy voice signal received by the microphone array is expressed as: y m (t)=s m (t)+n m (t), m = 1, 2, ... M, where M represents the number of microphones, s m (t) represents a pure voice signal, n m (t) represents the interference noise signal;

[0072] 2) Perform short-time Fourier transform on the noisy speech signal received in step 1), and obtain the time-frequency domain signal in the form of Y m (f,t)=S m (f,t)+N m (f,t), where Y m (f,t), S m (f,t), N m (f, t) respectively represent the signal at time t, frequency f, targ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a speech enhancement method for speech recognition in a noise environment, and the method comprises the steps: combining the improved MVDR beam forming which is based on the time-frequency masking and employs the speech time-frequency domain sparsity principle with the improved Wiener filtering, acquiring a speech signal of a microphone array, and constructing an MVDR beamformer based on time-frequency masking, making full use of the spatial information of the speech signal, enhancing the speech signal in a target direction, suppressing the interference of noise in other directions, and then removing the residual noise and improving the speech intelligibility through a modified Wiener filter. The method is applied to a voice recognition front end, and can effectively remove noise and improve the voice intelligibility, thereby improving the recognition rate of a speech recognition system, solving a problem of how to reduce the speech distortion in the noise environment and improve the speech recognition rate of the noise environment. The method can be applied to a household robot and intelligent voice equipment.

Description

technical field [0001] The invention relates to the technical field of speech recognition in a noisy environment, in particular to a speech enhancement method applied to speech recognition in a noisy environment. Background technique [0002] With the development of computer and Internet technology, speech recognition technology has made remarkable progress, and it has gradually moved from the research of scientific research institutions to the market, and is widely used in various fields such as industry, communication, home service, and medical treatment. Speech recognition is mainly to enable machines to understand the content of human language, to perform corresponding operations, and to achieve the purpose of human-computer interaction. [0003] In recent years, speech recognition technology has developed rapidly. Single-channel speech recognition technology has achieved a high recognition rate in an ideal environment. How to improve the speech recognition rate in the a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L21/02G10L21/0216G10L15/26
CPCG10L15/26G10L21/02G10L21/0216G10L2021/02166
Inventor 曾庆宁刘伟波罗瀛唐滔李玉婷
Owner GUILIN UNIV OF ELECTRONIC TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products