Supercharge Your Innovation With Domain-Expert AI Agents!

Sound source direction determination method and device based on deep learning, equipment and medium

A technology of deep learning and determination method, applied in the field of deep learning, can solve the problem that the accuracy of determining the direction of the sound source cannot be effectively improved, and achieve the effect of improving the accuracy

Pending Publication Date: 2021-12-24
SHENZHEN EMEET TECH CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The main purpose of the present invention is to provide a sound source direction determination method, device, equipment and medium based on deep learning, aiming to solve the technical problem that the existing technology cannot effectively improve the accuracy of determining the sound source direction

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Sound source direction determination method and device based on deep learning, equipment and medium
  • Sound source direction determination method and device based on deep learning, equipment and medium
  • Sound source direction determination method and device based on deep learning, equipment and medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0053] It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0054] refer to figure 1 , figure 1 It is a schematic diagram of the structure of the device for determining the direction of the sound source based on deep learning in the hardware operating environment involved in the solution of the embodiment of the present invention.

[0055] like figure 1As shown, the device for determining the sound source direction based on deep learning may include: a processor 1001, such as a central processing unit (Central Processing Unit, CPU), a communication bus 1002, a user interface 1003, a network interface 1004, and a memory 1005. Wherein, the communication bus 1002 is used to realize connection and communication between these components. The user interface 1003 may include a display screen (Display), an input unit such as a keyboard (Keyboard), and the optional user interface 10...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the technical field of deep learning, and discloses a sound source direction determination method and device based on deep learning, equipment and a medium, and the method comprises the steps: obtaining phase spectrum information according to a target mixed sound source signal; generating corresponding feature dimension information according to the phase spectrum information and the preset length frame sequence information; predicting the feature dimension information according to a preset convolutional recurrent neural network to obtain a DOA vector information set; determining direction information of the target mixed sound source according to the arrival vector information set. According to the invention, the feature dimension information is generated through the phase spectrum information and the preset length frame sequence information, the feature dimension information is predicted according to the preset convolutional recurrent neural network, and the direction information of the target mixed sound source is determined based on the predicted arrival vector information set, so that the direction of the target mixed sound source is determined. Compared with the prior art that the sound source direction is estimated through a traditional DOA algorithm, the accuracy of determining the sound source direction can be effectively improved.

Description

technical field [0001] The present invention relates to the technical field of deep learning, in particular to a sound source direction determination method, device, equipment and medium based on deep learning. Background technique [0002] The direction of sound source is also known as the direction of arrival (Direction of Arrival, DOA), with the recording device as the reference system, the purpose of DOA is to judge the direction of the speaker's sound source, it is usually used as the pre-processing of the speech system, and the speech The judgment of the direction of the human sound source is widely used. For example, the spatial information of the sound source needs to be obtained in advance in the beamforming algorithm, and the direction of the sound source needs to be determined in the sound source localization and sound source tracking tasks. Currently, the commonly used The technical solution for determining the direction of the sound source is to determine the di...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G01S5/18G06N3/04G06N3/08
CPCG01S5/18G06N3/08G06N3/044G06N3/045
Inventor 陈文明陈新磊张洁张世明
Owner SHENZHEN EMEET TECH CO LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More