Sound source direction estimation method and device based on time-frequency masking and deep neural network

A technology of deep neural network and time-frequency masking, which is applied to the direction or direction finder using ultrasonic/sonic/infrasonic wave, etc., which can solve the problems of poor robustness and improve accuracy and stability. , the effect of strong robustness

Active Publication Date: 2022-03-01
ELEVOC TECH CO LTD
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In order to solve the technical problem of poor robustness of orientation estimation, the present disclosure provides a sound source direction estimation method, device, electronic equipment, and storage medium based on time-frequency masking and deep neural network

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Sound source direction estimation method and device based on time-frequency masking and deep neural network
  • Sound source direction estimation method and device based on time-frequency masking and deep neural network
  • Sound source direction estimation method and device based on time-frequency masking and deep neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0059] Reference will now be made in detail to exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, the same numerals in different drawings refer to the same or similar elements unless otherwise indicated. The implementations described in the following exemplary examples do not represent all implementations consistent with the present invention. Rather, they are merely examples of apparatuses and methods consistent with aspects of the invention as recited in the appended claims.

[0060] figure 1 It is a flowchart of a sound source direction estimation method based on time-frequency masking and deep neural network according to an exemplary embodiment. The sound source orientation estimation method based on time-frequency masking and deep neural network can be used in electronic devices such as smart phones, smart homes, and computers. Such as figure 1 As shown, the sound s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The disclosure discloses a sound source orientation estimation method, device, electronic equipment, and storage medium based on time-frequency masking and deep neural networks, and belongs to the field of computer technology. The method includes: acquiring a multi-channel sound signal; performing framing, windowing and Fourier transform on each channel sound signal in the multi-channel sound signal to form a short-time Fourier transform of the multi-channel sound signal Spectrum; The short-time Fourier spectrum is iteratively calculated by the pre-trained neural network model, and the ratio film corresponding to the target signal in the multi-channel sound signal is calculated, and a plurality of ratio films are fused to form a single ratio film; The single ratio membrane performs masking and weighting on the multi-channel sound signal to determine the orientation of the target sound source. The above sound source direction estimation method and device based on time-frequency masking and deep neural network can have strong robustness in environments with low signal-to-noise ratio and strong reverberation, and improve the accuracy and stability of target sound source direction estimation.

Description

technical field [0001] The present disclosure relates to the field of computer application technology, and in particular to a sound source direction estimation method, device, electronic equipment, and storage medium based on time-frequency masking and deep neural networks. Background technique [0002] Sound source localization in noisy environments has many real-life applications, such as human-computer interaction, robotics, and beamforming. Traditionally, GCC-PHAT (Generalized Cross Correlation Phase Transform, generalized cross-correlation-phase transformation method), SRP-PHAT (Steered Response Power Phase Transform, phase transformation weighted controllable response power method) or MUSIC (Multiple Signal Classification, multiple Signal classification) and other sound source localization algorithms are the most common. However, these algorithms can only localize the loudest signal sources in the environment, which may not be the target speaker at all. For example, ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G01S3/802
CPCG01S3/802
Inventor 不公告发明人
Owner ELEVOC TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products