Sound source positioning method and system based on deep neural network
A deep neural network and sound source localization technology, applied in the field of sound source localization method and system based on deep neural network, can solve the problem of immature sound source localization method, etc., and achieve good scalability and good algorithm robustness. Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0053] The first embodiment provides a sound source localization method based on a deep neural network, including a training phase of the deep neural network and a testing phase of the deep neural network, such as Figure 1-2 shown, including steps:
[0054] S11. Acquire the voice signal received by the microphone, and generate a voice data set from the acquired voice signal; wherein, the voice data set includes a training data set and a test data set;
[0055] S12. Perform first preprocessing on the speech signal in the generated speech data set;
[0056] S13. Calculate the phase-weighted generalized cross-correlation function of the sound source signal corresponding to the preprocessed speech signal;
[0057] S14. Obtain the time delay information corresponding to the peak of the phase-weighted generalized cross-correlation function, and use the obtained time delay information as the TDOA observation value of the sound source signal arriving at the microphone; and obtain th...
Embodiment 2
[0111] This embodiment provides a sound source localization system based on a deep neural network, including:
[0112] The first acquiring module is used to acquire the voice signal received by the microphone, and generate a voice data set from the acquired voice signal; wherein, the voice data set includes a training data set and a test data set;
[0113] A first preprocessing module, configured to perform first preprocessing on the speech signals in the generated speech data set;
[0114] A calculation module, configured to calculate a phase-weighted generalized cross-correlation function of the sound source signal corresponding to the preprocessed speech signal;
[0115] The second obtaining module is used to obtain the time delay information corresponding to the peak of the phase-weighted generalized cross-correlation function, and use the obtained time delay information as the TDOA observation value of the sound source signal arriving at the microphone; and obtain the tim...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


