Training device of voiceprint recognition model

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A voiceprint recognition and training device technology, applied in biological neural network models, speech analysis, instruments, etc., can solve problems such as complex channel differences, achieve good learning effects, and improve recall and accuracy

Active Publication Date: 2021-01-22

SOUTHWEST UNIVERSITY OF POLITICAL SCIENCE AND LAW +1

View PDF6 Cites 8 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0008] Of course, due to the personal information of the speaker such as gender and dialect contained in the voice, it appears as a different frequency distribution on the spectrum, and the channel difference is mainly reflected in the change in the frequency domain, so information such as gender and dialect will make the channel difference more complicated.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0036]Such asfigure 1As shown, a training device for a voiceprint recognition model of the present invention includes a sample collection and processing module (not shown in the figure), a feature input module 1, a feature extractor 2, a pooling layer 3, a speaker classifier 4, and a domain Classifier 5, gender classifier 6, dialect classifier 7 and optimization processing module (not shown in the figure), of which:

[0037]The sample acquisition and processing module is used to collect two-channel voice samples for voiceprint recognition and comparison training. The voice samples collected in one channel are labeled with feature labels according to the sample object, and the voice samples collected in the other channel are not Annotate feature labels, and pass the processed voice samples to feature input module 1;

[0038]The feature input module 1 is used to extract heuristic phonetic features and MFCC features from each voice sample, and merge the two to form input features and output ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

According to the training device of the voiceprint recognition model, the phonetic features containing the identity information of the speaker are extracted as the input features, multi-task trainingis carried out by using labels such as the gender of the speaker, the cross-channel problem is solved in combination with an adversarial training method, and finally, the stable features reflecting the identity nature of the speaker are extracted. According to the invention, the linguistic characteristics and the deep neural network are combined to simulate the learning mechanism of the human brain, so that the extraction capability, stability and interpretability of the identity essential characteristics of the speaker are improved, and finally, the accuracy and recall rate of automatic voiceprint recognition are improved.

Description

Technical field[0001]The invention relates to the field of automatic voiceprint recognition, in particular to a training device for a voiceprint recognition model for judicial voice evidence evaluation mode.Background technique[0002]In the task of speaker identity identification in the field of forensic speech, the current mainstream identification methods in China are based on several dimensions such as watching, listening, and testing, and rely on the personal experience of voiceprint experts. This method is time-consuming, labor-intensive, and contains the subjective judgment of appraisal experts, and cannot be quickly promoted in a larger group of practitioners. In addition, limited by the characteristics of this type of method, it can only be suitable for small-scale inspection materials and sample scenarios. When the inspection materials and samples to be compared are hundreds, thousands or more, voiceprint identification experts are not enough to deal with Such a huge task. F...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L17/00G10L17/02G10L17/04G10L17/14G10L17/18G10L25/24G10L25/69G06N3/04G06N3/08

CPCG10L17/02G10L17/04G10L17/14G10L17/18G10L25/24G10L25/69G06N3/049G06N3/08G06N3/045

Inventor张翠玲谭铁君李稀敏杨东升叶志坚肖龙源

OwnerSOUTHWEST UNIVERSITY OF POLITICAL SCIENCE AND LAW

Training device of voiceprint recognition model

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology