Method for fast identifying speeking person based on comparing ordinal number of archor model space projection

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A technology of speaker confirmation and spatial projection, applied in speech analysis, instruments, etc., can solve problems such as lack of rationality, and achieve the effect of simplifying the training process, overcoming incompleteness, and widening security and adaptability

Inactive Publication Date: 2006-06-14

ZHEJIANG UNIV

View PDF0 Cites 13 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, the speaker identification technology based on the anchor model still has many deficiencies, and the method of directly comparing model scores is unreasonable.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0013] The present invention will be further described below in conjunction with embodiment. The method of the present invention is divided into six steps.

[0014] Step 1: Audio Preprocessing

[0015] Audio preprocessing is divided into three parts: sampling quantization, zero drift removal, pre-emphasis and windowing.

[0016] 1. Sampling and quantization

[0017] A), filter the audio signal with a sharp cut-off filter to make its Nyquist frequency F N 4KHZ;

[0018] B), set the audio sampling rate F=2F N ;

[0019] C), for audio signal s a (t) Sampling by period to obtain the amplitude sequence of the digital audio signal s ( n ) = s a ( n F ) ,

[0020] D), s(n) is quantized and coded by pulse code modulation (PCM), and the quantized representation s'(n) of the amplitud...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a fast speaker confirming method based on the ordinal number comparison of anchor model spatial projection, firstly making anchor model mapping on the test voice, and then making ordinal number comparison between the mapped test voice and the speaker declared by the test voice. The anchor model mapping: firstly extracting the characteristics of the test voice to obtain an eigenvector sequence, then estimating the probability density of each Gauss mixed model in the anchor model and the background model to obtain a mapped score vector. And the ordinal number comparison arranges the scores in the vector components and compares the score ordinal numbers of the test voice and the declared speaker and calculates Euclidian distance between the ordinal numbers, and finally compares the distance with a threshold value to obtain the final result. The invention has wider safety and adaptivity.

Description

technical field [0001] The invention relates to a biometric technology, mainly a fast speaker confirmation method based on comparison of anchor model space projection ordinal numbers. Background technique [0002] Biometric identification technology refers to a technology that uses the human's own physiological or behavioral characteristics to identify the identity through the computer. Based on behavioral characteristics (voice, keystroke, gait, signature, etc.), the powerful functions of computers and network technology are used for image processing and pattern recognition to identify people's identities. Speaker recognition technology is a technology that automatically identifies the identity of the speaker based on the speech parameters that reflect the speaker's physiological and behavioral characteristics in the speech. Speaker recognition is based on speech, which includes not only human physiological characteristics, that is, innate anatomical differences, but also ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L17/16

Inventor杨莹春吴朝晖杨旻

OwnerZHEJIANG UNIV

Method for fast identifying speeking person based on comparing ordinal number of archor model space projection

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology