Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speaker recognition method based on twin network model and KNN (K-nearest neighbor) algorithm

A KNN algorithm and speaker recognition technology, which is applied in the field of human-computer interaction, can solve problems such as large amounts of data, large data samples, and unfavorable information entry, and achieve the effect of rapid identification and improved simplicity

Active Publication Date: 2019-09-06
HANGZHOU DIANZI UNIV
View PDF6 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The method based on template matching is to pre-train the recognized voiceprint samples and match the voiceprint to be recognized with it. This method is simple to operate, but the recognition accuracy is not high and a large number of data samples are required.
In the method based on the speech statistical model, the recognition task is defined as the probability of calculating the variable. This method has high recognition accuracy, but requires a large amount of data for verification.
In the method based on deep learning technology, the neural network is used to capture the hidden features of the speaker so that it can better represent the speaker. This method not only requires a large amount of data, but also requires the neural network to be updated every time the data set is updated. Retraining is not conducive to new information entry

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speaker recognition method based on twin network model and KNN (K-nearest neighbor) algorithm
  • Speaker recognition method based on twin network model and KNN (K-nearest neighbor) algorithm
  • Speaker recognition method based on twin network model and KNN (K-nearest neighbor) algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0042] The technical solutions provided by the present invention will be further described below in conjunction with the accompanying drawings.

[0043] In real life, with the increase and departure of people, if you want to recognize the speaker based on the voice, you need to add their voice signals when adding people. With the addition of new voices, you need to re-identify the existing The model is retrained so that updates are not utilized. The present invention proposes a model based on a twin network so that only newly added speech signals need to be added to the network, and the similarity between the speech signal and the speech to be tested is obtained by using the twin network for discrimination.

[0044]The present invention provides a system based on Siamese network model and KNN classification speaker recognition such as figure 1 shown. On the whole, the present invention comprises two major steps: Step S1: use the microphone to collect the voice information of...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a speaker recognition method based on a twin network model and a KNN (K-nearest neighbor)algorithm. The speaker recognition method based on the twin network model and the KNNalgorithmcomprises the steps that S1, voice information of a speaker is collectedby using a microphone and taken as a data set to train an RNN network model; and S2, the speaker is identified by using atrained RNN network model to construct the twin network model and combining the KNN algorithm. By adopting the technical scheme of the speaker recognition method based on the twin network model and the KNNalgorithm, the data set of the speaker in a database is trained, it is ensured that input of each speech signal inputted into a twinnetwork can output the characteristics representing the speaker, distances between different output characteristic vectors are calculated by cosine distance and used in the KNN algorithm for determining whether the speech signals belong to the same speaker or not, the speaker can be identified with asmall amount of samples, the network does not need to be retrained as the number of speakers increases, the requirement of data sample sizeofa neural network isreduced, and meanwhileinstantaneity and accuracy of speaker recognition are effectively improved.

Description

technical field [0001] The invention belongs to the technical field of human-computer interaction, in particular to the technical field of speaker recognition, and specifically designs a speaker recognition method based on a twin network model and a KNN algorithm. Background technique [0002] In the field of human-computer interaction, with the rapid development of artificial intelligence, pattern recognition and other technologies, the interaction between humans and computers is getting closer and closer. The traditional contact interaction methods can no longer meet people's needs. The interactive way of communication habits has become a research hotspot in recent years. As one of the main channels of human-computer interaction, speaker recognition has gradually become an important research topic in the field of interaction. [0003] Existing methods for speaker recognition mainly include methods of speech feature extraction and template matching technology, methods of s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L17/00G10L17/02G10L17/04G10L25/27
CPCG10L17/00G10L17/02G10L17/04G10L25/27
Inventor 张莉李文钧李竹
Owner HANGZHOU DIANZI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products