Unlock instant, AI-driven research and patent intelligence for your innovation.

Speaker recognition method and device based on clustering, equipment and storage medium

A speaker recognition and speaker technology, applied in speech analysis, instruments, etc., can solve the problem of low efficiency of multiple speaker recognition and achieve the effect of improving efficiency

Pending Publication Date: 2021-12-28
PING AN TECH (SHENZHEN) CO LTD
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The present invention provides a speaker recognition method, device, equipment and storage medium based on clustering, so as to solve the technical problem of low efficiency in identifying multiple speakers in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speaker recognition method and device based on clustering, equipment and storage medium
  • Speaker recognition method and device based on clustering, equipment and storage medium
  • Speaker recognition method and device based on clustering, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0025] The embodiments of the present application may acquire and process relevant data based on artificial intelligence technology. Among them, artificial intelligence (AI) is a theory, method, technology and application system that uses digital computers or machines controlled by digital computers to simulate, extend and expand human intelligence, perceive the environment, acquire knowledge and use knowledge to obtain the best results. .

[0026] Ar...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a speaker recognition method and device based on clustering, equipment and a storage medium, belonging to the technical field of artificial intelligence. The method provided by the invention comprises the steps of performing segmentation processing on a to-be-determined audio to obtain at least two target voice segments; extracting a Mel-frequency cepstral coefficient of each target voice segment, and inputting the Mel-frequency cepstral coefficients into a time delay neural network for feature extraction to obtain acoustic features of each target voice segment; inputting each acoustic feature into a pre-trained speech recognition model for embedding generation to obtain speaker embedding of each target speech segment; and clustering each speaker embedding through a clustering algorithm to obtain a clustering result, and determining the identity of a speaker according to the clustering result. The method and the device are used for improving the efficiency of identifying a plurality of speakers.

Description

technical field [0001] The present invention relates to the technical field of artificial intelligence, in particular to a speaker recognition method, device, equipment and storage medium based on clustering. Background technique [0002] Voiceprint Recognition (VPR) is a kind of biometric information recognition technology, also known as speaker recognition (Speaker Recognition, SR), which is a technology for judging the speaker's identity through sound. Compared with traditional identification technology, the advantage of voiceprint recognition is that each person's voiceprint features are unique, and it is not easy to forge and counterfeit. Because voiceprint recognition has the characteristics of safety, reliability, and convenience, it is widely used in occasions where identification is required. [0003] In the scene where multiple people speak alternately, such as a meeting, it is necessary to identify who the speaker is at the current time point, so as to determine ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L17/18G10L17/14G10L17/02G10L25/24
CPCG10L17/18G10L17/14G10L17/02G10L25/24
Inventor 张旭龙王健宗
Owner PING AN TECH (SHENZHEN) CO LTD