Sound source positioning method and system, electronic equipment and storage medium

A technology of sound source localization and algorithm, which is applied in the field of signal processing, can solve problems such as large amount of computation, decreased practicability, and reduced algorithm practicability, so as to achieve the effect of improving performance and increasing proportion

Active Publication Date: 2020-09-04
RDA MICROELECTRONICS TECH SHANGHAI CO LTD
View PDF4 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the performance of the algorithm will drop sharply in low signal-to-noise ratio and reverberant environments, and its large amount of calculation will reduce the practicability
[0005] At present, the improvement of the SRP-PHAT algorithm is concentrated in two aspects: one is to improve the efficiency of the algorithm by optimizing the search strategy, but the results are not good in the case of noise, reverberation, and interference; the other is to calculate the time and frequency of each time-frequency point The "energy" envelope state of the direct wave determines the direct wave. Although the performance in high reverberation and interference environments is improved, it increases the amount of computation of the original SRP-PHAT algorithm and reduces the practicability of the algorithm.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Sound source positioning method and system, electronic equipment and storage medium
  • Sound source positioning method and system, electronic equipment and storage medium
  • Sound source positioning method and system, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0052] This embodiment provides a sound source localization method, such as figure 1 shown, including:

[0053] Step S101. Divide the speech signal received by the reference microphone into at least one frequency sub-band; wherein the reference microphone is one of the microphones in the microphone array.

[0054] In an optional implementation manner, the microphone closest to the sound source in the microphone array is selected as the reference microphone. In this embodiment, since the reference microphone is closest to the sound source, the quality of the speech signal received by it is better than that of other microphones in the microphone array. Therefore, speech detection and sound source detection are performed based on the speech signal with better quality. The positioning result is more accurate.

[0055] In a specific implementation, the speech signal received by the reference microphone is subjected to windowing and frame division processing, and each frame of the...

Embodiment 2

[0089] This embodiment provides a sound source localization system 400, such as Figure 4 As shown, it includes a frequency band division module 401 , a speech detection module 402 and a sound source localization module 403 .

[0090] The frequency band division module is used to divide the speech signal received by the reference microphone into at least one sub-frequency band; wherein the reference microphone is one of the microphones in the microphone array.

[0091] The voice detection module is used to calculate the voice presence probability for each sub-band according to the likelihood ratio of voice relative to noise obtained in the VAD detection process in the case of detecting whether there is voice in the voice signal through VAD.

[0092] The sound source localization module is used to calculate the maximum controllable response power of the microphone array by using the SRP-PHAT algorithm weighted based on the frequency-divided voice presence probability, and deter...

Embodiment 3

[0105] Figure 5 A schematic structural diagram of an electronic device provided in this embodiment. The electronic device includes a memory, a processor, and a computer program stored on the memory and operable on the processor, and the processor implements the sound source localization method of Embodiment 1 when executing the program. Figure 5 The electronic device 3 shown is only an example, and should not impose any limitation on the functions and application scope of the embodiments of the present invention.

[0106] The electronic device 3 may be in the form of a general computing device, eg it may be a server device. Components of the electronic device 3 may include but not limited to: the at least one processor 4 mentioned above, the at least one memory 5 mentioned above, and the bus 6 connecting different system components (including the memory 5 and the processor 4 ).

[0107] The bus 6 includes a data bus, an address bus and a control bus.

[0108] The memory 5...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a sound source positioning method and system, electronic equipment and a storage medium. The sound source positioning method comprises the steps of dividing a voice signal received by a reference microphone into at least one sub-band, wherein the reference microphone is one microphone in a microphone array; if voice exists in the voice signal through VAD detection, for eachsub-band, calculating the voice existence probability according to the likelihood ratio of voice relative noise obtained in the VAD detection process; and calculating the maximum controllable response power of the microphone array by using an SRP-PHAT algorithm based on sub-band voice existence probability weighting, and determining the direction of the sound source according to the angle corresponding to the maximum controllable response power. According to the invention, the voice existence probability of each frequency band is calculated by utilizing the intermediate result of the VAD detection voice and is weighted into the original SRP-PHAT algorithm, so that the sound source positioning performance under the low noise ratio is improved under the condition of not increasing the additional calculation amount.

Description

technical field [0001] The invention relates to the field of signal processing, in particular to a sound source localization method and system, electronic equipment and a storage medium. Background technique [0002] At present, speech recognition performs better in scenarios with high signal-to-noise ratio, no reverberation, and relatively close sound source distance. However, as the voice interaction scene expands to applications such as smart speakers, cars, and robots, the single-microphone system cannot meet practical applications, and the microphone array can pick up sound sources in different directions to achieve space-time signal processing to improve the performance of far-field voice interaction. been widely studied. [0003] Sound source localization is the first step in the signal processing of the microphone array and is implemented during the voice wake-up phase. Its accuracy will directly affect the effect of subsequent signal processing, usually combined w...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G01S5/20G06F17/14
CPCG01S5/20G06F17/141Y02D30/70
Inventor 夏洁方思敏罗丽云
Owner RDA MICROELECTRONICS TECH SHANGHAI CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products