Voice sound source positioning method using microphone array

A microphone array and sound source localization technology, which is applied in positioning, speech analysis, instruments, etc., can solve problems such as noise interference and reverberation, and achieve the effect of reducing interference and strong feature learning ability

Active Publication Date: 2020-02-25
NANJING UNIV
View PDF11 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In a large number of practical application scenarios, there is not only reverberation, but also noise interference. Most current methods cannot maintain high accuracy and robustness in such complex environments.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice sound source positioning method using microphone array
  • Voice sound source positioning method using microphone array
  • Voice sound source positioning method using microphone array

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] Below in conjunction with accompanying drawing and specific embodiment, further illustrate the present invention, should be understood that these examples are only for illustrating the present invention and are not intended to limit the scope of the present invention, after having read the present invention, those skilled in the art will understand various aspects of the present invention All modifications of the valence form fall within the scope defined by the appended claims of the present application.

[0025] This embodiment is carried out in simulation, providing a method based on the UNET structure and utilizing a microphone array for localizing speech sound sources, which is suitable for environments with high interference and high reverberation, and can be applied to arrays of different shapes, including the following steps:

[0026] 1. Generate training samples, obtain time-frequency domain signals, and obtain power envelopes.

[0027] Arrange speech or interf...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a voice sound source positioning method using a microphone array. The voice sound source positioning method comprises the following steps: (1) generating a training sample to obtain a time-frequency domain signal, and acquiring a power envelope; (2) judging whether each time-frequency point of the time-frequency domain signal is a voice direct sound signal or not; (3) training a neural network of a UNET structure by using the sample generated in the step (1); (4) predicting a time-frequency point corresponding to the voice direct sound of a to-be-detected noisy signal by using a trained neural network of the UNET structure; and (5) when time-frequency point is judged to be the time-frequency point of the voice direct sound, applying a positioning method to obtain apositioning result. According to the voice sound source positioning method disclosed by the invention, the influence of interference and reverberation can be effectively removed in a high reverberation and high interference environment, and a result with high accuracy and robustness is obtained.

Description

technical field [0001] The invention relates to a voice sound source positioning method based on a UNET structure and using a microphone array in a high-interference and high-reverberation environment, and belongs to the technical field of voice signal processing. Background technique [0002] The purpose of Speech Source Localization (SSL) is to estimate the angle (Direction-of-Arrival, DOA) at which the speech signal arrives at the microphone array. Sound source localization, or DOA estimation, of speech signals using a microphone array is a very important and hot topic in acoustic signal processing. It plays a very important role in capturing sound in many application scenarios, such as human-computer voice interaction of smart devices, lens tracking and intelligent monitoring. However, the difficulty lies in that the speech signal is a broadband non-stationary random process, and there are also background noise, reverberation and other interference sources. [0003] Cl...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L25/30G10L25/51G01S5/18
CPCG10L25/30G10L25/51G01S5/18
Inventor 王浩卢晶
Owner NANJING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products