Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method for fast identifying speaker based on comparing ordinal number of archor model space projection

A technology for speaker confirmation and spatial projection, applied in speech analysis, speech recognition, instruments, etc., can solve problems such as lack of rationality, and achieve the effects of simplifying the training process, enhancing reliability, extensive security and adaptability

Inactive Publication Date: 2009-12-16
ZHEJIANG UNIV
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the speaker identification technology based on the anchor model still has many deficiencies, and the method of directly comparing model scores is unreasonable.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for fast identifying speaker based on comparing ordinal number of archor model space projection
  • Method for fast identifying speaker based on comparing ordinal number of archor model space projection
  • Method for fast identifying speaker based on comparing ordinal number of archor model space projection

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0013] The present invention will be further described below in conjunction with embodiment. The method of the present invention is divided into six steps.

[0014] Step 1: Audio Preprocessing

[0015] Audio preprocessing is divided into three parts: sampling quantization, zero drift removal, pre-emphasis and windowing.

[0016] 1. Sampling and quantization

[0017] A), filter the audio signal with a sharp cut-off filter to make its Nyquist frequency F N 4KHZ;

[0018] B), set the audio sampling rate F=2F N ;

[0019] C), for audio signal s a (t) Sampling by period to obtain the amplitude sequence of the digital audio signal s ( n ) = s a ( n F ) ;

[0020] D), s(n) is quantized and coded by pulse code modulation (PCM), and the quantized representation s'(n) of the amplitud...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a fast speaker confirming method based on the ordinal number comparison of anchor model spatial projection, firstly making anchor model mapping on the test voice, and then making ordinal number comparison between the mapped test voice and the speaker declared by the test voice. The anchor model mapping: firstly extracting the characteristics of the test voice to obtain an eigenvector sequence, then estimating the probability density of each Gauss mixed model in the anchor model and the background model to obtain a mapped score vector. And the ordinal number comparison arranges the scores in the vector components and compares the score ordinal numbers of the test voice and the declared speaker and calculates Euclidian distance between the ordinal numbers, and finally compares the distance with a threshold value to obtain the final result. The invention has wider safety and adaptivity.

Description

technical field [0001] The invention relates to a biometric technology, mainly a fast speaker confirmation method based on comparison of anchor model space projection ordinal numbers. Background technique [0002] Biometric identification technology refers to a technology that uses the human's own physiological or behavioral characteristics to identify the identity through the computer. Based on behavioral characteristics (voice, keystroke, gait, signature, etc.), the powerful functions of computers and network technology are used for image processing and pattern recognition to identify people's identities. Speaker recognition technology is a technology that automatically identifies the identity of the speaker based on the speech parameters that reflect the speaker's physiological and behavioral characteristics in the speech. Speaker recognition is based on speech, which includes not only human physiological characteristics, that is, innate anatomical differences, but also ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L17/00G10L15/00G10L17/16
Inventor 杨莹春吴朝晖杨旻
Owner ZHEJIANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products