Binaural sound source positioning method based on binaural matching filter

A matched filter and positioning method technology, applied in the information field, can solve the problems of not considering the difference and reliability of binaural time difference, not taking into account, and difficult to extract time delay, etc.

Active Publication Date: 2014-07-02
PEKING UNIV SHENZHEN GRADUATE SCHOOL
View PDF4 Cites 36 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in geometric positioning, the coordinates of the sound source are calculated directly by using the relationship between the binaural features and the environmental position. This mode can get an accurate solution in theory, but it is susceptible to interference from environmental noise and reverberation.
[0017] Existing methods generally adopt ideas similar to pattern recognition, generally do not consider the relationship between binaural time difference and binaural energy difference, and are mostly divided into two independent modules to calculate the two, such as using generalized cross-correlation (including using Different weighting functions) to calculate the binaural time difference, using the logarithmic energy ratio method to calculate the binaural energy difference, and the weighted generalized cross-correlation is mostly proposed from the problem of overcoming the difficulty of extracting time delay caused by different environments, and does not consider binaural The difference and reliability of the time difference on each sub-band
Therefore, the traditional method requires a more complex computing system, and the global feature matching model also faces the bottleneck of exponential growth in computational complexity. It is necessary to propose a feature that can better reflect the mutual influence relationship between the binaural time difference and the binaural energy difference. Expressing sound source location information

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Binaural sound source positioning method based on binaural matching filter
  • Binaural sound source positioning method based on binaural matching filter
  • Binaural sound source positioning method based on binaural matching filter

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0090] The technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. It should be understood that the described embodiments are only some of the embodiments of the present invention, not all of them. example. Based on the embodiments of the present invention, all other embodiments obtained by those skilled in the art without making creative efforts belong to the protection scope of the present invention.

[0091] In this implementation example, the CIPIC database of the University of California, Davis is used for testing, which has the largest number of head collections and the largest number of direction collections. This database is more authoritative and one of the most widely used databases in the international sound source localization of humanoid robots. A total of 45 human heads were tested in the database, including 27 adult m...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a novel binaural sound source positioning method based on a Bayesian hierarchical mode. Firstly, a reliable frequency band selection mechanism guarantees that a frequency band selected for estimating an interaural time difference is reliable, and estimation accuracy of the time difference is improved; secondly, an interaural intensity difference is used for shrinking a candidate direction set obtained in the first layer; thirdly, the fact that a binaural matching filter is used as novel binaural positioning characteristics is proposed in the third layer, the binaural matching filter describes differences between binaural signals, and the relation between the interaural time difference and the interaural intensity difference can be shown sufficiently; finally, searching space is gradually reduced in a three-layer positioning process, so that the direction with the maximum probability is obtained by adopting Bayesian decision criterions. By means of a hierarchical positioning system, the number of times of characteristic machining can be reduced effectively, time complexity of algorithms is reduced, and the real-time requirement of the sound source positioning system is guaranteed.

Description

technical field [0001] The invention belongs to the field of information technology, and relates to a binaural sound source localization method applied in speech perception and speech enhancement, in particular to a binaural sound source localization method based on a binaural matched filter. Background technique [0002] Binaural audio naturally has many advantages for communication and multimedia experiences. In the daily interaction between people, auditory perception is one of the most effective and direct ways of interaction between people. Among them, in the main process of daily perception of the world and information acquisition, people obtain about 70%-80% of the information through vision, and about 10%-20% of the information through hearing. Therefore, in the process of continuously improving the intelligence of robots, the auditory interaction of robots is an indispensable research direction. The auditory system of humans and other mammals has a strong sound so...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G01S5/18
CPCG01S5/18
Inventor 刘宏张结丁润伟
Owner PEKING UNIV SHENZHEN GRADUATE SCHOOL
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products