Speaker recognition systems

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology for identifying systems and speakers, applied in speech analysis, instruments, etc., can solve the problems of HMM model that does not allow model-to-model comparison, cannot be easily world model/impersonation group, cannot be effectively analyzed and used, etc.

Inactive Publication Date: 2004-11-03

SECURIVOX

View PDF0 Cites 15 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Third, HMM models inherently do not allow efficient model-to-model comparisons

Therefore, the important structural details contained in the enrolled speaker data set cannot be efficiently analyzed and used to improve the performance of the system

Fourth, HMM technology uses temporal information to construct models, so it is vulnerable to pretenders who imitate other people's voices by temporarily changing pitch, etc.

Fifth, the required speaker cannot easily optimize the world model / fake cohort used by the system for the purpose of testing utterances

However, the mean of the true speaker distribution will be considerably lower than the mean of the impostor distribution tested with the same model

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0035]In order to provide an improved speaker recognition (authentication and / or discrimination) system, the present invention comprises various aspects and features which can be combined in various ways. Some of these aspects relate to the way speech samples are modeled during speaker registration and subsequent recognition of input speech samples. Other aspects concern how the input speech model is classified in order to enable decisions about speaker identity. Yet another aspect concerns the normalization of the speech signal input to the speaker recognition system (channel normalization). Another aspect concerns the application of speaker recognition systems.

[0036] now refer to Figure 4 to Figure 6 The basic structure used by the system incorporating the different aspects of the invention is described. It will be understood that the input to the embodiments of the invention described herein are all digital signals including speech samples, which have been previously...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

Speaker recognition (identification and / or verification) methods and systems, in which speech models for enrolled speakers consist of sets of feature vectors representing the smoothed frequency spectrum of each of a plurality of frames and a clustering algorithm is applied to the feature vectors of the frames to obtain a reduced data set representing the original speech sample, and wherein the adjacent frames are overlapped by at least 80%. Speech models of this type model the static components of the speech sample and exhibit temporal independence. An identifier strategy is employed in which modelling and classification processes are selected to give a false rejection rate substantially equal to zero. Each enrolled speaker is associated with a cohort of a predetermined number of other enrolled speakers and a test sample is always matched with either the claimed identity or one of its associated cohort. This makes the overall error rate of the system dependent only on the false acceptance rate, which is determined by the cohort size. The false error rate is further reduced by use of multiple parallel modelling and / or classification processes. Speech models are normalised prior to classification using a normalisation model derived from either the test speech sample or one of the enrolled speaker samples (most preferably from the claimed identity enrolment sample).

Description

technical field [0001] The present invention relates to a system, method and apparatus for performing speaker recognition. Background technique [0002] Speaker recognition includes the related fields of speaker verification and speaker identification. Its main purpose is to confirm the speaker's claimed identity from his / her voice, which is called authentication, or to identify the speaker from his / her voice, which is called discrimination. Both of these use the human voice as a biometric measure and infer the unique relationship between a voice and the person who produced it. This unique relationship enables verification and discrimination. Speaker recognition techniques analyze test utterances and compare them to known templates or models for the person being identified or authenticated. The efficiency of this system depends on the quality of the algorithms used in the processing. [0003] Speaker recognition systems have a variety of applications. According to anoth...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L17/02G10L17/12G10L17/20

CPCG10L17/12G10L17/20G10L17/02

Inventor 安德鲁·托马斯·萨佩利克

Owner SECURIVOX

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Speaker recognition systems

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology