An end-to-end sound source localization method and system based on multi-task learning
A multi-task learning, sound source localization technology, applied in the field of end-to-end sound source localization methods and systems, can solve problems such as poor localization performance and insufficient robustness
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0059] DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS The preferred embodiments of the present invention will be described in more detail below with reference to the accompanying drawings of the present invention. figure 1 Shown is the basic block diagram of the end-to-end sound source localization algorithm based on multi-task learning proposed by the present invention. The specific implementation steps of the method of the present invention include calculation delay, input time domain signal, compensation delay, CNN extraction feature, DNN recovery signal, calculate channel-to-channel coherence, and estimate target sound source locations. The specific implementation process of each step is as follows:
[0060] 1. Calculation delay
[0061] In the scanning method, since the position to be scanned and the distribution of the microphone array are known, the time delay can also be obtained by calculating the known scanning position and microphone position, which is also know...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


