A microphone array speech enhancement system and method for audio and video information fusion

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A microphone array and speech enhancement technology, applied in speech analysis, instrumentation, computing, etc., can solve the problems of low speech quality, inconvenient monitoring, and error judgment in the estimation of incoming wave direction, so as to improve anti-noise performance, reduce The effect of the influence of ambient noise

Active Publication Date: 2020-02-18

SOUTH CHINA UNIV OF TECH

View PDF8 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, the existing microphone array speech enhancement technologies are all based on air-conduction speech sensors, which have the following deficiencies in practical applications: (1) when the environmental noise is strong, the output speech quality is not high; (2) when the environment is used When there are multiple sound sources in the environment, the estimation of the direction of arrival of the microphone array is prone to misjudgment; (3) When there are multiple sound sources in the use environment, the traditional direction of arrival estimation usually selects the sound source signal with the strongest sound. Enhanced, it is inconvenient for users to specify a certain sound source for monitoring

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0048] The specific implementation steps of the present invention will be further described below in conjunction with the accompanying drawings and embodiments, but the embodiments of the present invention are not limited thereto.

[0049] The system structure of the embodiment of the present invention is as figure 1 As shown, it is composed of a video acquisition module, a microphone array receiving module, an audio and video direction of arrival joint estimation module, a microphone array speech enhancement module, and an audio and video joint speech enhancement module, wherein the video acquisition module and the audio and video arrival direction joint estimation module , audio and video joint voice enhancement module, used to collect the video signal of the speaker in the application scene; the microphone array receiving module is connected to the joint estimation module of the direction of arrival of audio and video, and the microphone array voice enhancement module, used ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a microphone array voice enhancement system and method for audio and video information fusion. The system includes a video acquisition module, a microphone array receiving module, an audio and video incoming direction joint estimation module, a microphone array speech enhancement module and an audio and video joint speech enhancement module, and the video acquisition module is used to collect the video signal of the speaker in the application scene; the microphone The array receiving module is used to receive the speaker's audio signal; the audio and video direction of arrival joint estimation module uses audio and video information to jointly estimate the direction of arrival of the speaker's audio; the microphone array voice enhancement module uses the array voice signal received by the microphone array receiving module To enhance the voice signal; the audio-video joint voice enhancement module uses voice and video signals to jointly enhance the voice twice. The invention can significantly improve the performance of the microphone array voice enhancement system, and can be widely used in video conferences, car phones, mobile video call terminals and other occasions.

Description

technical field [0001] The invention relates to the field of speech signal processing, in particular to a microphone array speech enhancement system for fusion of audio and video information. Background technique [0002] In the actual use environment, the communication equipment is susceptible to interference such as background noise and reverberation, which affects the quality and intelligibility of the voice signal. Therefore, in many communication applications, effective voice enhancement processing is required to suppress noise and improve voice. Clarity, intelligibility and comfort. [0003] At present, the commonly used speech enhancement methods mainly include two categories. One is the speech enhancement method based on a single microphone, including spectral subtraction, Wiener filter, MMSE, Kalman filter, wavelet transform, etc. This type of method uses a single microphone to receive speech signals. The noise is suppressed by filtering and processing in the time ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G10L21/0216G06K9/00

CPCG10L21/0216G10L2021/02166G06V40/162G06V40/165

Inventor 张军陈鑫源宁更新冯义志季飞余华陈芳炯

Owner SOUTH CHINA UNIV OF TECH

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

A microphone array speech enhancement system and method for audio and video information fusion

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology