Speaker recognition method and device based on clustering, equipment and storage medium

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A speaker recognition and speaker technology, applied in speech analysis, instruments, etc., can solve the problem of low efficiency of multiple speaker recognition and achieve the effect of improving efficiency

Pending Publication Date: 2021-12-28

PING AN TECH (SHENZHEN) CO LTD

View PDF0 Cites 2 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005] The present invention provides a speaker recognition method, device, equipment and storage medium based on clustering, so as to solve the technical problem of low efficiency in identifying multiple speakers in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0024] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0025] The embodiments of the present application may acquire and process relevant data based on artificial intelligence technology. Among them, artificial intelligence (AI) is a theory, method, technology and application system that uses digital computers or machines controlled by digital computers to simulate, extend and expand human intelligence, perceive the environment, acquire knowledge and use knowledge to obtain the best results. .

[0026] Ar...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a speaker recognition method and device based on clustering, equipment and a storage medium, belonging to the technical field of artificial intelligence. The method provided by the invention comprises the steps of performing segmentation processing on a to-be-determined audio to obtain at least two target voice segments; extracting a Mel-frequency cepstral coefficient of each target voice segment, and inputting the Mel-frequency cepstral coefficients into a time delay neural network for feature extraction to obtain acoustic features of each target voice segment; inputting each acoustic feature into a pre-trained speech recognition model for embedding generation to obtain speaker embedding of each target speech segment; and clustering each speaker embedding through a clustering algorithm to obtain a clustering result, and determining the identity of a speaker according to the clustering result. The method and the device are used for improving the efficiency of identifying a plurality of speakers.

Description

technical field [0001] The present invention relates to the technical field of artificial intelligence, in particular to a speaker recognition method, device, equipment and storage medium based on clustering. Background technique [0002] Voiceprint Recognition (VPR) is a kind of biometric information recognition technology, also known as speaker recognition (Speaker Recognition, SR), which is a technology for judging the speaker's identity through sound. Compared with traditional identification technology, the advantage of voiceprint recognition is that each person's voiceprint features are unique, and it is not easy to forge and counterfeit. Because voiceprint recognition has the characteristics of safety, reliability, and convenience, it is widely used in occasions where identification is required. [0003] In the scene where multiple people speak alternately, such as a meeting, it is necessary to identify who the speaker is at the current time point, so as to determine ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L17/18G10L17/14G10L17/02G10L25/24

CPCG10L17/18G10L17/14G10L17/02G10L25/24

Inventor 张旭龙王健宗

Owner PING AN TECH (SHENZHEN) CO LTD

Speaker recognition method and device based on clustering, equipment and storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology