A Speaker Recognition Method Based on Mutual Information Estimation
A speaker recognition and mutual information technology, applied in the field of speaker recognition, can solve the problem of inability to judge the uniqueness of the speaker's feature representation, and achieve the effect of reducing EER and optimizing network training.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0034] The technical solution adopted by the present invention is a method for speaker identification based on mutual information estimation, which comprises the following steps:
[0035] Step 1. Preprocess all the voices in the dataset and extract spectrogram features;
[0036] Step 2. In the training phase, the spectrogram is first extracted from the speech and used as the input of the VGG-M network; then random triplet sampling is performed on the training data to obtain positive and negative sample pairs; finally, positive and negative sample pairs are obtained. Perform mutual information estimation, and use the objective function based on mutual information estimation to perform network training and update network parameters;
[0037] Step 3, using the trained VGG-M network to extract the embedded feature vector representing the speaker identity feature corresponding to the test voice and the target speaker voice;
[0038] Step 4. Calculate the cosine distance between th...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


