Supercharge Your Innovation With Domain-Expert AI Agents!

A Speaker Recognition Method Based on Siamese Network Model and KNN Algorithm

A KNN algorithm and speaker recognition technology, applied in the field of human-computer interaction, can solve the problems of a large number of data, a large number of data samples, and low recognition accuracy, and achieve the effect of rapid recognition and improved simplicity

Active Publication Date: 2021-05-04
HANGZHOU DIANZI UNIV
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The method based on template matching is to pre-train the recognized voiceprint samples and match the voiceprint to be recognized with it. This method is simple to operate, but the recognition accuracy is not high and a large number of data samples are required.
In the method based on the speech statistical model, the recognition task is defined as the probability of calculating the variable. This method has high recognition accuracy, but requires a large amount of data for verification.
In the method based on deep learning technology, the neural network is used to capture the hidden features of the speaker so that it can better represent the speaker. This method not only requires a large amount of data, but also requires the neural network to be updated every time the data set is updated. Retraining is not conducive to new information entry

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Speaker Recognition Method Based on Siamese Network Model and KNN Algorithm
  • A Speaker Recognition Method Based on Siamese Network Model and KNN Algorithm
  • A Speaker Recognition Method Based on Siamese Network Model and KNN Algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0042] The technical solutions provided by the present invention will be further described below in conjunction with the accompanying drawings.

[0043] In real life, with the increase and departure of people, if you want to recognize the speaker based on the voice, you need to add their voice signals when adding people. With the addition of new voices, you need to re-identify the existing The model is retrained so that updates are not utilized. The present invention proposes a model based on a twin network so that only newly added speech signals need to be added to the network, and the similarity between the speech signal and the speech to be tested is obtained by using the twin network for discrimination.

[0044]The present invention provides a system based on Siamese network model and KNN classification speaker recognition such as figure 1 shown. On the whole, the present invention comprises two major steps: step S1: use the microphone to collect the voice information of...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a speaker recognition method based on a twin network model and a KNN algorithm. Step S1: use a microphone to collect voice information of the speaker as a data set to train the RNN network model; step S2: use the trained RNN network to construct a twin network The model is combined with the KNN algorithm to identify the speaker. Adopt the technical solution of the present invention, train the data set of the speaker in the database, determine that the input of each speech signal input into the Siamese network can output the characteristics of the speaker, and use the cosine distance to calculate the difference between the different output feature vectors. The distance between them and the use of the KNN algorithm to determine whether they belong to the same speaker, so that a small number of samples can also identify the speaker, and with the increase in the number of speakers, the network does not need to be retrained, which reduces the number of data samples for the neural network requirements, while effectively improving the real-time and accuracy of speaker recognition.

Description

technical field [0001] The invention belongs to the technical field of human-computer interaction, in particular to the technical field of speaker recognition, and specifically designs a speaker recognition method based on a twin network model and a KNN algorithm. Background technique [0002] In the field of human-computer interaction, with the rapid development of artificial intelligence, pattern recognition and other technologies, the interaction between humans and computers is getting closer and closer. The traditional contact interaction methods can no longer meet people's needs. The interactive way of communication habits has become a research hotspot in recent years. As one of the main channels of human-computer interaction, speaker recognition has gradually become an important research topic in the field of interaction. [0003] Existing methods for speaker recognition mainly include methods of speech feature extraction and template matching technology, methods of s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L17/00G10L17/02G10L17/04G10L25/27
CPCG10L17/00G10L17/02G10L17/04G10L25/27
Inventor 张莉李文钧李竹
Owner HANGZHOU DIANZI UNIV
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More