Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech enhancement method for speech recognition in noise environment

A speech enhancement and speech recognition technology, applied in speech recognition, speech analysis, instruments, etc.

Active Publication Date: 2018-11-16
GUILIN UNIV OF ELECTRONIC TECH
View PDF7 Cites 43 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The purpose of the present invention is to solve the problem that the speech recognition rate is high in a quiet environment and the recognition rate drops sharply in a noisy environment, and proposes a speech enhancement method applied to speech recognition in a noisy environment. The noise component in the noisy voice signal can improve the voice recognition rate of the voice recognition system, and has a good application prospect for household voice interaction robots or mobile smart devices

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech enhancement method for speech recognition in noise environment
  • Speech enhancement method for speech recognition in noise environment
  • Speech enhancement method for speech recognition in noise environment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0070] like figure 1 As shown, a speech enhancement method applied to speech recognition in a noisy environment is to construct a MVDR beamformer based on time-frequency masking, and post an improved Wiener filter to perform speech enhancement processing on the direction of the target sound source. Including the following steps:

[0071] 1) Use the four-element microphone array model to receive the speech signal, and the time domain representation of the noisy speech signal received by the microphone array is: y m (t)=s m (t)+n m (t), m=1,2,...M, where M represents the number of microphones, s m (t) represents a pure speech signal, n m (t) represents the interference noise signal;

[0072] 2) Short-time Fourier transform is carried out to the noisy speech signal received in step 1), and the representation form of the time-frequency domain signal obtained is Y m (f,t)=S m (f,t)+N m (f,t), where Y m (f,t), S m (f,t),N m (f, t) respectively represent the signal at time...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a speech enhancement method for speech recognition in a noise environment, and the method comprises the steps: combining the improved MVDR beam forming which is based on the time-frequency masking and employs the speech time-frequency domain sparsity principle with the improved Wiener filtering, acquiring a speech signal of a microphone array, and constructing an MVDR beamformer based on time-frequency masking, making full use of the spatial information of the speech signal, enhancing the speech signal in a target direction, suppressing the interference of noise in other directions, and then removing the residual noise and improving the speech intelligibility through a modified Wiener filter. The method is applied to a voice recognition front end, and can effectively remove noise and improve the voice intelligibility, thereby improving the recognition rate of a speech recognition system, solving a problem of how to reduce the speech distortion in the noise environment and improve the speech recognition rate of the noise environment. The method can be applied to a household robot and intelligent voice equipment.

Description

technical field [0001] The invention relates to the technical field of speech recognition in a noisy environment, in particular to a speech enhancement method applied to speech recognition in a noisy environment. Background technique [0002] With the development of computer and Internet technology, speech recognition technology has made remarkable progress, and it has gradually moved from the research of scientific research institutions to the market, and is widely used in various fields such as industry, communication, home service, and medical treatment. Speech recognition is mainly to enable machines to understand the content of human language, to perform corresponding operations, and to achieve the purpose of human-computer interaction. [0003] In recent years, speech recognition technology has developed rapidly. Single-channel speech recognition technology has achieved a high recognition rate in an ideal environment. How to improve the speech recognition rate in the a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L21/02G10L21/0216G10L15/26
CPCG10L15/26G10L21/02G10L21/0216G10L2021/02166
Inventor 曾庆宁刘伟波罗瀛唐滔李玉婷
Owner GUILIN UNIV OF ELECTRONIC TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products