Speech enhancement method for speech recognition in noise environment

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A speech enhancement and speech recognition technology, applied in speech recognition, speech analysis, instruments, etc.

Active Publication Date: 2018-11-16

GUILIN UNIV OF ELECTRONIC TECH

View PDF7 Cites 43 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005] The purpose of the present invention is to solve the problem that the speech recognition rate is high in a quiet environment and the recognition rate drops sharply in a noisy environment, and proposes a speech enhancement method applied to speech recognition in a noisy environment. The noise component in the noisy voice signal can improve the voice recognition rate of the voice recognition system, and has a good application prospect for household voice interaction robots or mobile smart devices

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment

[0070] like figure 1 As shown, a speech enhancement method applied to speech recognition in a noisy environment is to construct a MVDR beamformer based on time-frequency masking, and post an improved Wiener filter to perform speech enhancement processing on the direction of the target sound source. Including the following steps:

[0071] 1) Use the four-element microphone array model to receive the speech signal, and the time domain representation of the noisy speech signal received by the microphone array is: y m (t)=s m (t)+n m (t), m=1,2,...M, where M represents the number of microphones, s m (t) represents a pure speech signal, n m (t) represents the interference noise signal;

[0072] 2) Short-time Fourier transform is carried out to the noisy speech signal received in step 1), and the representation form of the time-frequency domain signal obtained is Y m (f,t)=S m (f,t)+N m (f,t), where Y m (f,t), S m (f,t),N m (f, t) respectively represent the signal at time...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a speech enhancement method for speech recognition in a noise environment, and the method comprises the steps: combining the improved MVDR beam forming which is based on the time-frequency masking and employs the speech time-frequency domain sparsity principle with the improved Wiener filtering, acquiring a speech signal of a microphone array, and constructing an MVDR beamformer based on time-frequency masking, making full use of the spatial information of the speech signal, enhancing the speech signal in a target direction, suppressing the interference of noise in other directions, and then removing the residual noise and improving the speech intelligibility through a modified Wiener filter. The method is applied to a voice recognition front end, and can effectively remove noise and improve the voice intelligibility, thereby improving the recognition rate of a speech recognition system, solving a problem of how to reduce the speech distortion in the noise environment and improve the speech recognition rate of the noise environment. The method can be applied to a household robot and intelligent voice equipment.

Description

technical field [0001] The invention relates to the technical field of speech recognition in a noisy environment, in particular to a speech enhancement method applied to speech recognition in a noisy environment. Background technique [0002] With the development of computer and Internet technology, speech recognition technology has made remarkable progress, and it has gradually moved from the research of scientific research institutions to the market, and is widely used in various fields such as industry, communication, home service, and medical treatment. Speech recognition is mainly to enable machines to understand the content of human language, to perform corresponding operations, and to achieve the purpose of human-computer interaction. [0003] In recent years, speech recognition technology has developed rapidly. Single-channel speech recognition technology has achieved a high recognition rate in an ideal environment. How to improve the speech recognition rate in the a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L21/02G10L21/0216G10L15/26

CPCG10L15/26G10L21/02G10L21/0216G10L2021/02166

Inventor 曾庆宁刘伟波罗瀛唐滔李玉婷

Owner GUILIN UNIV OF ELECTRONIC TECH

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Speech enhancement method for speech recognition in noise environment

What is Al technical title? Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document. A speech enhancement and speech recognition technology, applied in speech recognition, speech analysis, instruments, etc.

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment

PUM

Abstract

Description

Claims

Application Information

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A speech enhancement and speech recognition technology, applied in speech recognition, speech analysis, instruments, etc.

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology