Speaker confirmation method and speaker confirmation device used in short voice condition

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A speaker confirmation and speaker technology, applied in the field of speaker confirmation, can solve the problems of system recognition rate decline, system performance decline, users are unwilling to speak for a long time, etc., to achieve the effect of improving recognition performance and reducing space-time complexity

Inactive Publication Date: 2016-08-10

SPEAKIN TECH CO LTD

View PDF3 Cites 21 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Although text-dependent techniques are effective for short speech speaker recognition, text-dependent speaker recognition is not applicable to such situations

2. Due to the problem of the call transmission channel, the quality of many phone calls is very poor, resulting in serious voice interruption

The usual solution is to remove discontinuous speech segments that contain little or no speaker information, and the result must be that the effective speech becomes shorter

3. In the process of solving the multi-speaker problem, due to the lack of maturity of the current voice segmentation technology, or the existence of overlapping voices, it is necessary to It is cut off before being sent to the recognizer, which will inevitably lead to shorter effective speech

4. In some commercial occasions, users are unwilling to speak a long voice

[0013] At present, the GMM system is the most commonly used system for speaker recognition technology. This system is based on the theory of statistical models and requires training and test speech to reach a certain length. Otherwise, the system performance will be greatly reduced.

In other words, in the case of short speech, the system recognition rate will be greatly reduced

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0055] The specific implementation manners of the present invention will be further described below in conjunction with the accompanying drawings.

[0056] In the speaker recognition method, after the speech signal is preprocessed, a few seconds of speech will generate a large amount of data. The process of extracting speaker feature parameters is actually the process of removing redundant information in the original speech and reducing the amount of data. Linear Prediction Cepstral Coefficients (LPCC) and Mel Frequency Cepstral Coefficients (MFCC) are the two most commonly used characteristic parameters in speaker identification. The former simulates the vocal tract effect, and the latter simulates the human auditory effect. But these two features only consider the information within the speech frame, but not the information between the speech frames. Because the speech signal is time-sequential, obtaining time-varying information between speech frames can improve the perfor...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a speaker confirmation method and a speaker confirmation device used in a short voice condition. The method comprises the steps of extracting a linear prediction cepstral coefficient, a Mel frequency cepstral coefficient and a Delta characteristic from an objective voice signal; combining the linear prediction cepstral coefficient, the Mel frequency cepstral coefficient and the Delta characteristic, thereby obtaining a plurality of effective characteristic vectors; reducing the dimension number of the effective characteristic vectors by means of a partial fuzzy PCA method; and modeling by means of the dimension-reduced effective characteristic vectors according to a Gaussian mixture model, and identifying the speaker of the objective voice signal. Compared with the prior art, the speaker confirmation method and the speaker confirmation device are advantageous in that a single characteristic is replaced by a characteristic combination, thereby improving number of dimensions of the effective characteristics and compensating defects of a characteristic sample, and furthermore the local fuzzy PCA is used for performing effective dimension reduction on the combined characteristic, thereby reducing time-space complexity of the system under a precondition of low effect on identification rate.

Description

technical field [0001] The invention belongs to the technical field of voice recognition, and in particular relates to a speaker confirmation method and device under short voice conditions. Background technique [0002] In the process of speaker recognition technology moving toward practical application, the following situations are often encountered: 1. Terrorists or intercepted objects, for the consideration of anti-interception, often speak very short, sometimes even only two or three words. Although text-related techniques are effective for short speech speaker recognition, text-related speaker recognition cannot be used for such situations. 2. Due to the problem of the call transmission channel, the quality of many phone calls is very poor, resulting in serious voice interruption. The usual solution is to remove discontinuous speech segments that hardly contain or can hardly extract speaker information, and the result must be that the effective speech becomes shorter. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L17/00G10L17/02G10L17/04

CPCG10L17/00G10L17/02G10L17/04

Inventor陈昊亮

OwnerSPEAKIN TECH CO LTD

Speaker confirmation method and speaker confirmation device used in short voice condition

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology