A Speaker Recognition Method Based on Siamese Network Model and KNN Algorithm

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A KNN algorithm and speaker recognition technology, applied in the field of human-computer interaction, can solve the problems of a large number of data, a large number of data samples, and low recognition accuracy, and achieve the effect of rapid recognition and improved simplicity

Active Publication Date: 2021-05-04

HANGZHOU DIANZI UNIV

View PDF6 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

The method based on template matching is to pre-train the recognized voiceprint samples and match the voiceprint to be recognized with it. This method is simple to operate, but the recognition accuracy is not high and a large number of data samples are required.

In the method based on the speech statistical model, the recognition task is defined as the probability of calculating the variable. This method has high recognition accuracy, but requires a large amount of data for verification.

In the method based on deep learning technology, the neural network is used to capture the hidden features of the speaker so that it can better represent the speaker. This method not only requires a large amount of data, but also requires the neural network to be updated every time the data set is updated. Retraining is not conducive to new information entry

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0042] The technical solutions provided by the present invention will be further described below in conjunction with the accompanying drawings.

[0043] In real life, with the increase and departure of people, if you want to recognize the speaker based on the voice, you need to add their voice signals when adding people. With the addition of new voices, you need to re-identify the existing The model is retrained so that updates are not utilized. The present invention proposes a model based on a twin network so that only newly added speech signals need to be added to the network, and the similarity between the speech signal and the speech to be tested is obtained by using the twin network for discrimination.

[0044]The present invention provides a system based on Siamese network model and KNN classification speaker recognition such as figure 1 shown. On the whole, the present invention comprises two major steps: step S1: use the microphone to collect the voice information of...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a speaker recognition method based on a twin network model and a KNN algorithm. Step S1: use a microphone to collect voice information of the speaker as a data set to train the RNN network model; step S2: use the trained RNN network to construct a twin network The model is combined with the KNN algorithm to identify the speaker. Adopt the technical solution of the present invention, train the data set of the speaker in the database, determine that the input of each speech signal input into the Siamese network can output the characteristics of the speaker, and use the cosine distance to calculate the difference between the different output feature vectors. The distance between them and the use of the KNN algorithm to determine whether they belong to the same speaker, so that a small number of samples can also identify the speaker, and with the increase in the number of speakers, the network does not need to be retrained, which reduces the number of data samples for the neural network requirements, while effectively improving the real-time and accuracy of speaker recognition.

Description

technical field [0001] The invention belongs to the technical field of human-computer interaction, in particular to the technical field of speaker recognition, and specifically designs a speaker recognition method based on a twin network model and a KNN algorithm. Background technique [0002] In the field of human-computer interaction, with the rapid development of artificial intelligence, pattern recognition and other technologies, the interaction between humans and computers is getting closer and closer. The traditional contact interaction methods can no longer meet people's needs. The interactive way of communication habits has become a research hotspot in recent years. As one of the main channels of human-computer interaction, speaker recognition has gradually become an important research topic in the field of interaction. [0003] Existing methods for speaker recognition mainly include methods of speech feature extraction and template matching technology, methods of s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G10L17/00G10L17/02G10L17/04G10L25/27

CPCG10L17/00G10L17/02G10L17/04G10L25/27

Inventor 张莉李文钧李竹

Owner HANGZHOU DIANZI UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

A Speaker Recognition Method Based on Siamese Network Model and KNN Algorithm

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology