Speaker counting method and device based on deep learning, equipment and storage medium

A technology of deep learning and counting method, applied in the field of deep learning, can solve the problem that the accuracy rate of speaker counting cannot be effectively improved, and achieve the effect of improving the accuracy rate

Pending Publication Date: 2022-01-07
SHENZHEN EMEET TECH CO LTD
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The main purpose of the present invention is to provide a speaker counting method, device, equipment and storage medium based

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speaker counting method and device based on deep learning, equipment and storage medium
  • Speaker counting method and device based on deep learning, equipment and storage medium
  • Speaker counting method and device based on deep learning, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0055] It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0056] refer to figure 1 , figure 1 It is a schematic structural diagram of a speaker counting device based on deep learning of the hardware operating environment involved in the solution of the embodiment of the present invention.

[0057] Such as figure 1As shown, the speaker counting device based on deep learning may include: a processor 1001, such as a central processing unit (Central Processing Unit, CPU), a communication bus 1002, a user interface 1003, a network interface 1004, and a memory 1005. Wherein, the communication bus 1002 is used to realize connection and communication between these components. The user interface 1003 may include a display screen (Display), an input unit such as a keyboard (Keyboard), and the optional user interface 1003 may also include a standard wired interface and a wireless in...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the technical field of deep learning, and discloses a speaker counting method and device based on deep learning, equipment, and a storage medium. The method comprises the steps: obtaining corresponding amplitude spectrum information and phase spectrum information according to the time domain voice signals of multiple channels in a target region; generating corresponding feature dimension information according to the amplitude spectrum information, the phase spectrum information and the preset frame sequence length information; predicting the feature dimension information according to a preset convolutional recurrent neural network model; and determining the number of speakers in the target area based on the predicted voice signal probability distribution information. According to the invention, the voice signal probability distribution information is obtained through the preset convolutional recurrent neural network model and the feature dimension information, the number of speakers in the target area is determined according to the voice signal probability distribution information so as to count the speakers in the target area, and the speaker counting accuracy can be effectively improved.

Description

technical field [0001] The present invention relates to the technical field of deep learning, in particular to a speaker counting method, device, equipment and storage medium based on deep learning. Background technique [0002] Speaker number detection refers to the detection of the number of speakers in a speech signal. It is usually located in the preprocessing stage of speech-related systems and affects the performance of subsequent tasks to a certain extent, such as speech separation tasks and speaker distinction tasks. , sound source localization tasks, etc., and to realize speech tasks such as speaker distinction task and sound source localization task, it is necessary to determine the number of speakers contained in the speech signal within a period of time. Therefore, how to accurately and efficiently count the number of speakers is very important Speech-related systems are extremely important, and the currently commonly used technical solutions for counting the num...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L15/02G10L15/16G10L25/30G10L25/51
CPCG10L15/02G10L15/16G10L25/30G10L25/51
Inventor 陈文明陈新磊张洁张世明
Owner SHENZHEN EMEET TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products