Method for recognizing speaker based on multivariate core logistic regression model

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A regression model and implementation method technology, applied in speech analysis, instruments, etc., can solve problems such as slow speed, low recognition rate, and complex model construction

Inactive Publication Date: 2011-11-23

ZHEJIANG UNIV OF TECH

View PDF7 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005] In order to overcome the shortcomings of low recognition rate, complex model construction and slow speed of the existing speaker identification implementation methods, the present invention provides a multivariate kernel logistic regression model based on high recognition rate, simple model construction, and good rapidity The implementation method of speaker identification

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0042] The present invention will be further described below.

[0043] A method for implementing speaker discrimination based on a multivariate kernel logistic regression model, comprising the following steps:

[0044] A), speaker speech feature extraction: collect the speech signal of the speaker to be identified, and carry out preprocessing; then extract the Mel cepstrum parameters, the Mel cepstrum parameters are 13th order cepstrum parameters, which will describe the speaker's personality characteristics The weaker zeroth order coefficient is removed, and the remaining 12-dimensional feature vector is used as the speaker identification input vector;

[0045]B), speaker model construction: multivariate kernel logistic regression model is used as the speaker identification model,

[0046] p ( c i = k | x ‾ ; ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a method for recognizing a speaker based on a multivariate core logistic regression model, comprising the following steps: (A) extracting voice features of the speaker: collecting voice signals of the speaker to be recognized to pre-process, and then extracting mel cepstrum parameters; (B) constructing a speaker model: using a multivariate core logistic regression model asa speaker recognition model; (C) training the speaker recognition model: using the feature vectors extracted from the step A as input training samples, through a minimal sequence optimization algorithm, carrying out an iterative training to optimize the model parameters; (D) recognizing the speaker: extracting the feature vectors of the voice signals of the speaker to be recognized and inputting to the recognition model of the trained speaker, and giving out a posterior probability of each speaker by the multivariate core logistic regression model, wherein the highest probability value is a recognition result. The invention has high rate of recognition, simple model construction and good rapidity.

Description

technical field [0001] The invention relates to the fields of signal processing, machine learning and pattern recognition, in particular to a method for realizing speaker identification. Background technique [0002] Speaker identification refers to automatically identifying whether the speaker is in the specified speaker set by analyzing and processing the speaker's voice signal in a limited set and extracting features, and then confirming the specific identity of the speaker. The basic principle of speaker identification is to build a classification model for each speaker that can describe its personality characteristics. Therefore, excellent model construction is one of the key technologies for speaker identification. [0003] Traditional speaker identification models include generative models such as Mixed Gaussian Model (GMM) and Hidden Markov Model (HMM). Although these models can achieve good recognition efficiency, a large number of training samples are required to...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G10L17/00G10L17/04

Inventor 王万良郑建炜郑泽萍韩姗姗蒋一波王震宇王磊陈胜勇

Owner ZHEJIANG UNIV OF TECH

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Method for recognizing speaker based on multivariate core logistic regression model

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology