Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speaker recognition systems

A technology for identifying systems and speakers, applied in speech analysis, instruments, etc., can solve the problems of HMM model that does not allow model-to-model comparison, cannot be easily world model/impersonation group, cannot be effectively analyzed and used, etc.

Inactive Publication Date: 2004-11-03
SECURIVOX
View PDF0 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Third, HMM models inherently do not allow efficient model-to-model comparisons
Therefore, the important structural details contained in the enrolled speaker data set cannot be efficiently analyzed and used to improve the performance of the system
Fourth, HMM technology uses temporal information to construct models, so it is vulnerable to pretenders who imitate other people's voices by temporarily changing pitch, etc.
Fifth, the required speaker cannot easily optimize the world model / fake cohort used by the system for the purpose of testing utterances
However, the mean of the true speaker distribution will be considerably lower than the mean of the impostor distribution tested with the same model

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speaker recognition systems
  • Speaker recognition systems
  • Speaker recognition systems

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035]In order to provide an improved speaker recognition (authentication and / or discrimination) system, the present invention comprises various aspects and features which can be combined in various ways. Some of these aspects relate to the way speech samples are modeled during speaker registration and subsequent recognition of input speech samples. Other aspects concern how the input speech model is classified in order to enable decisions about speaker identity. Yet another aspect concerns the normalization of the speech signal input to the speaker recognition system (channel normalization). Another aspect concerns the application of speaker recognition systems.

[0036] now refer to Figure 4 to Figure 6 The basic structure used by the system incorporating the different aspects of the invention is described. It will be understood that the input to the embodiments of the invention described herein are all digital signals including speech samples, which have been previously...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Speaker recognition (identification and / or verification) methods and systems, in which speech models for enrolled speakers consist of sets of feature vectors representing the smoothed frequency spectrum of each of a plurality of frames and a clustering algorithm is applied to the feature vectors of the frames to obtain a reduced data set representing the original speech sample, and wherein the adjacent frames are overlapped by at least 80%. Speech models of this type model the static components of the speech sample and exhibit temporal independence. An identifier strategy is employed in which modelling and classification processes are selected to give a false rejection rate substantially equal to zero. Each enrolled speaker is associated with a cohort of a predetermined number of other enrolled speakers and a test sample is always matched with either the claimed identity or one of its associated cohort. This makes the overall error rate of the system dependent only on the false acceptance rate, which is determined by the cohort size. The false error rate is further reduced by use of multiple parallel modelling and / or classification processes. Speech models are normalised prior to classification using a normalisation model derived from either the test speech sample or one of the enrolled speaker samples (most preferably from the claimed identity enrolment sample).

Description

technical field [0001] The present invention relates to a system, method and apparatus for performing speaker recognition. Background technique [0002] Speaker recognition includes the related fields of speaker verification and speaker identification. Its main purpose is to confirm the speaker's claimed identity from his / her voice, which is called authentication, or to identify the speaker from his / her voice, which is called discrimination. Both of these use the human voice as a biometric measure and infer the unique relationship between a voice and the person who produced it. This unique relationship enables verification and discrimination. Speaker recognition techniques analyze test utterances and compare them to known templates or models for the person being identified or authenticated. The efficiency of this system depends on the quality of the algorithms used in the processing. [0003] Speaker recognition systems have a variety of applications. According to anoth...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L17/02G10L17/12G10L17/20
CPCG10L17/12G10L17/20G10L17/02
Inventor 安德鲁·托马斯·萨佩利克
Owner SECURIVOX
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products