Method for recognizing speaker based on conversion of neutral and affection sound-groove model

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A speaker recognition and model conversion technology, applied in speech recognition, speech analysis, instruments, etc., can solve problems affecting system recognition performance and achieve the effect of improving recognition rate

Inactive Publication Date: 2008-07-23

ZHEJIANG UNIV

View PDF0 Cites 44 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] Traditional speaker recognition methods require users to provide neutral speech for user model training and user testing, but in daily life, people's speech will be affected by their own emotional fluctuations, which will affect the recognition performance of the system

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0011] The present invention will be further introduced below in conjunction with accompanying drawing and embodiment: the method of the present invention is divided into three steps altogether.

[0012] The first step feature extraction

[0013] I. Audio preprocessing

[0014] Audio preprocessing is divided into three parts: sampling quantization, zero drift removal, pre-emphasis and windowing.

[0015] A), sampling quantization

[0016] Filter the audio signal with a sharp cut-off filter so that its Nyquist frequency FN is 4KHZ;

[0017] Set the audio sampling rate F=2FN; the audio signal sa(t) is sampled periodically to obtain the amplitude sequence of the digital audio signal s ( n ) = sa ( n F ) ;

[0018] Quantize and code s(n) with pulse code modulation (PCM) to obtain the quantized representation...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a speaker identification method based on neutralization and sound-groove model conversion, the steps comprises (1) extracting voice feature, firstly conducting voice frequency pre-treating which is divided into three parts of sample-taking quantification, zero drift elimination, then extracting reverse spectrum signature MFCC, (2) building emotion model library, conducting Gaussian compound model training, training neutral model according to the neutral voice training of the users, conducting neutralization-emotion model conversion and obtaining emotion voice model by algorithm approach of neutralization-emotion voice conversion and (3) scoring for the voice test to identify the speakers. The invention has the advantages that the technique uses the algorithm approach of neutralization-emotion model conversion to increase the identification rate of the emotive speaker identifying. The technique trains out emotion voice model of the users according to the neutralization voice model of the users and increases the identification rate of the system.

Description

technical field [0001] The invention relates to biological feature recognition technology, and mainly relates to a speaker recognition method based on conversion of neutral and emotional voiceprint models. Background technique [0002] Biometric authentication technology uses people's own physical characteristics as the basis for identity authentication, which is fundamentally different from traditional authentication technologies based on "what you have" or "what you know", and truly uses people themselves as the basis for identity authentication , who truly represent themselves. Among them, the technology of identity authentication based on human voice is called speaker recognition technology. [0003] Speaker recognition is divided into two steps: user model training and user voice testing. During the training process, the user is required to provide a user model for voice training and user identity matching. During the test, the user is required to provide voice for i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L17/00G10L15/02G10L15/06G10L15/08G10L17/02G10L17/04G10L25/24

Inventor吴朝晖杨莹春单振宇

OwnerZHEJIANG UNIV

Method for recognizing speaker based on conversion of neutral and affection sound-groove model

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology