Method and apparatus for training voiceprint recognition system

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of voiceprint recognition and computer system, which is applied in the field of training voiceprint recognition systems, can solve the problems of low accuracy rate and low improvement of voiceprint recognition accuracy rate, and achieve the effect of improving accuracy rate

Active Publication Date: 2017-01-04

TENCENT TECH (SHENZHEN) CO LTD +1

View PDF8 Cites 5 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005] In order to solve the problem in the prior art that the accuracy of voiceprint recognition is low by using identity vectors processed by LDA, an embodiment of the present invention provides a method and device for training a voiceprint recognition system

[0007] Since the regularization matrix determined by the computer system maximizes the sum of the first numerical values of each category, the identity vectors of the voices of different segments of the same user are improved after being regularized by the regularization matrix, which solves the problem of using LDA in related technologies. The problem of the low degree of improvement in the accuracy of voiceprint recognition for the processed identity vector improves the accuracy of voiceprint recognition

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0021] figure 1 It is a flowchart of a method for training a voiceprint recognition system provided in an embodiment of the present invention, such as figure 1 As shown, the method for training the voiceprint recognition system may include the following steps:

[0022] In step 101, the computer system determines the identity vector of each segment of speech in the speech training set, and divides the identity vectors of the same user's speech in the determined identity vector into one category.

[0023] Step 102, the computer system establishes a first function for calculating the first value corresponding to each category, the first value is the first identity vector in the corresponding category after regularization using the regularization matrix and the first identity vector in the corresponding category after regularization using the regularization matrix The sum of similarities between other identity vectors of , the random variable of the first function is a regular ma...

Embodiment 2

[0028] Figure 2A is a flow chart of a method for training a voiceprint recognition system provided in another embodiment of the present invention, as Figure 2A As shown, the method for training the voiceprint recognition system may include the following steps:

[0029] Step 201, the computer system determines the identity vector of each segment of speech in the speech training set, and classifies the identity vectors of the same user's speech in the determined identity vector into one category.

[0030] Generally speaking, for a user, at least two speeches of the user are recorded or collected, and these recorded or collected speeches are added to the speech training set, and the speech training set includes at least two speeches of the user.

[0031] Further, the computer system processes each segment of speech in the speech training set, generates an identity vector indicating the identity information of the person who entered the speech, and divides the identity vector o...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a method and apparatus for training voiceprint recognition system, belonging to the technical field of voiceprint recognition. The method comprises a step of determining the identity vector of each section of speech in a speed training set, and dividing the identity vectors of the speech of a same user in the determined identity vectors into a category, a step of establishing a first function for calculating the first value corresponding to each category, wherein the first value is the sum of the similarities of a first identity vector normalized by using a normalization matrix in a corresponding category and other identity vectors normalized by using the normalization matrix in the corresponding category, a step of determining the normalization matrix which allows the sum of the first value of each category to be maximum, and a step of normalizing the identity vector of the speech obtained in the voiceprint recognition system by using the determined normalization matrix. The problem of the low improvement of the accuracy of the voiceprint recognition by using an identity vector which is subjected to linear discrimination analysis processing in the related technology is solved, and the accuracy of the voiceprint recognition is improved.

Description

technical field [0001] The invention relates to the technical field of voiceprint recognition, in particular to a method and device for training a voiceprint recognition system. Background technique [0002] Voiceprint recognition is a kind of biometric recognition technology. By processing the voice, an identity vector indicating the identity information of the voice inputter can be generated. The two voices can be determined by calculating the similarity between the identity vectors of the two voices. Whether the voice inputter is the same user. [0003] Speech is easily disturbed by channel variability and environment variability, resulting in distortion of its identity vector. In related technologies, it is assumed that the distribution of several segments of speech of the same user in space is a multidimensional Gaussian distribution, and the identity vector is processed by using linear discriminant analysis (English: linear discriminant analysis, LDA), which compensat...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L17/04G10L17/02G10L17/14

CPCG10L17/02G10L17/04G10L17/14G10L17/08G10L17/00G10L17/22

Inventor 李为钱柄桦金星明李科吴富章吴永坚黄飞跃

Owner TENCENT TECH (SHENZHEN) CO LTD

Method and apparatus for training voiceprint recognition system

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology