Speaker counting method and device based on deep learning, equipment and storage medium

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A technology of deep learning and counting method, applied in the field of deep learning, can solve the problem that the accuracy rate of speaker counting cannot be effectively improved, and achieve the effect of improving the accuracy rate

Pending Publication Date: 2022-01-07

SHENZHEN EMEET TECH CO LTD

View PDF0 Cites 1 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] The main purpose of the present invention is to provide a speaker counting method, device, equipment and storage medium based on deep learning, aiming to solve the technical problem that the prior art cannot effectively improve the accuracy of speaker counting

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0055] It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0056] refer to figure 1 , figure 1 It is a schematic structural diagram of a speaker counting device based on deep learning of the hardware operating environment involved in the solution of the embodiment of the present invention.

[0057] Such as figure 1As shown, the speaker counting device based on deep learning may include: a processor 1001, such as a central processing unit (Central Processing Unit, CPU), a communication bus 1002, a user interface 1003, a network interface 1004, and a memory 1005. Wherein, the communication bus 1002 is used to realize connection and communication between these components. The user interface 1003 may include a display screen (Display), an input unit such as a keyboard (Keyboard), and the optional user interface 1003 may also include a standard wired interface and a wireless in...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to the technical field of deep learning, and discloses a speaker counting method and device based on deep learning, equipment, and a storage medium. The method comprises the steps: obtaining corresponding amplitude spectrum information and phase spectrum information according to the time domain voice signals of multiple channels in a target region; generating corresponding feature dimension information according to the amplitude spectrum information, the phase spectrum information and the preset frame sequence length information; predicting the feature dimension information according to a preset convolutional recurrent neural network model; and determining the number of speakers in the target area based on the predicted voice signal probability distribution information. According to the invention, the voice signal probability distribution information is obtained through the preset convolutional recurrent neural network model and the feature dimension information, the number of speakers in the target area is determined according to the voice signal probability distribution information so as to count the speakers in the target area, and the speaker counting accuracy can be effectively improved.

Description

technical field [0001] The present invention relates to the technical field of deep learning, in particular to a speaker counting method, device, equipment and storage medium based on deep learning. Background technique [0002] Speaker number detection refers to the detection of the number of speakers in a speech signal. It is usually located in the preprocessing stage of speech-related systems and affects the performance of subsequent tasks to a certain extent, such as speech separation tasks and speaker distinction tasks. , sound source localization tasks, etc., and to realize speech tasks such as speaker distinction task and sound source localization task, it is necessary to determine the number of speakers contained in the speech signal within a period of time. Therefore, how to accurately and efficiently count the number of speakers is very important Speech-related systems are extremely important, and the currently commonly used technical solutions for counting the num...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L15/02G10L15/16G10L25/30G10L25/51

CPCG10L15/02G10L15/16G10L25/30G10L25/51

Inventor陈文明陈新磊张洁张世明

OwnerSHENZHEN EMEET TECH CO LTD

Speaker counting method and device based on deep learning, equipment and storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology