Vocal print identification method, electronic device and computer readable storage medium

A voiceprint recognition and computer program technology, applied in the electronic field, can solve the problem of low accuracy of voiceprint recognition and achieve the effect of real-time evaluation of false positive rate

Active Publication Date: 2018-11-16
SHENZHEN UNIV
View PDF13 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Due to the complexity of the actual situation, different types of recognition subsystems in the prior art may not necessaril

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Vocal print identification method, electronic device and computer readable storage medium
  • Vocal print identification method, electronic device and computer readable storage medium
  • Vocal print identification method, electronic device and computer readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0056] Example one

[0057] The embodiment of this application provides a voiceprint recognition method, please refer to Figure 1-a , The voiceprint recognition method mainly includes the following steps:

[0058] 101. Acquire voice data to be analyzed;

[0059] The embodiment of the present invention is applied to a voiceprint recognition system. The voiceprint recognition system includes K subsystems, where K is an integer greater than zero. The system architecture of the voiceprint recognition system of the embodiment of the present invention can refer to Figure 1-b .

[0060] Wherein, each subsystem in the voiceprint recognition system may correspond to different types of voiceprint recognition, and the types of voiceprint recognition include: emotion recognition, age recognition, and language recognition. Further, each subsystem may also correspond to each sub-category in a recognition scenario. For example, in speech recognition, a subsystem corresponds to a language (such as ...

Example Embodiment

[0096] Example two

[0097] In the embodiment of the present invention, the error-prone point classifier needs to be constructed, please refer to Figure 1-c Methods include:

[0098] 201. Establish a training database;

[0099] Take the short-term speech data set as the test data set of each subsystem, and mark all misjudged speech segments in the test process as N different labels according to different subsystems, as the training database, where N is an integer greater than zero .

[0100] 202. Extract MFCC Mel frequency cepstrum coefficient characteristics;

[0101] For each short-term speech data in the training database, Mel Frequency Cepstrum Coefficient (MFCC) features are extracted.

[0102] 203. Training overall change matrix;

[0103] According to the extracted MFCC features, the Universal Background Model (UBM) is trained, and the overall change matrix T is trained.

[0104] 204. Obtain the change factor characteristics of the short-term voice data;

[0105] Obtain the change f...

Example Embodiment

[0114] Example three

[0115] The embodiment of the present invention takes a hybrid system for language recognition as an example to describe in detail the voiceprint recognition method in the embodiment of the present invention, including:

[0116] 1. The architecture of the hybrid system for language recognition in the embodiment of the present invention can be referred to Figure 1-b , Each subsystem independently gives probability values ​​of N different languages.

[0117] 2. Let x be a certain input voice, the output of each subsystem is shown in the following table:

[0118] Language code

[0119] P f (L j |x)(i=1,2,...,N) Each subsystem independently gives a certain input speech belonging to a certain language L j (j = 1, 2,..., N), and the sum of all probabilities is also 1, namely:

[0120]

[0121] Arrange the probabilities of all languages ​​given by the K subsystems into a matrix:

[0122]

[0123] 3. For the operation process of executing the dynamic weight sub-module, ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention provides a vocal print identification method, an electronic device and a computer readable storage medium. The vocal print identification method comprises the steps of: obtainingvoice data to be analyzed; extracting change factor features in the voice data; performing misjudgment classification of the voice data according to the change factor features through a fallible point classifier to obtain the relative misjudgment probability of the voice data being misjudged in K sub systems; determining the offset of the relative misjudgment probability corresponding to any onesub system and the average relative misjudgment probability of the K sub systems, and calculating the final fusion weights of the corresponding sub systems according to the offset; and performing weighing of the identification result of each sub system according to the final fusion weight, and obtaining an integrated identification result according to the identification result of each sub system after weighing.

Description

technical field [0001] The present application relates to the field of electronic technology, and in particular to a voiceprint recognition method, an electronic device, and a computer-readable storage medium. Background technique [0002] With the popularity of smart devices and related hardware facilities, voice interaction has become an indispensable part of human-computer interaction. There are more and more application scenarios related to voiceprint in voice interaction, including but not limited to: voiceprint attendance check-in, software login, bank transfer and account opening verification, wake-up of virtual voice assistant, personalized interaction for different user groups Wait, these systems all use voiceprint without exception. The so-called voiceprint refers to the unique voice characteristics of each person. In real life, everyone's voice has its own characteristics when speaking. Generally speaking, voiceprint recognition is divided into the following ca...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L17/06G10L25/24G10L25/51
CPCG10L17/06G10L25/24G10L25/51
Inventor 郑能恒林吉
Owner SHENZHEN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products