Background learning of speaker voices
A speaker and speaker model technology, applied in speech analysis, speech recognition, measuring devices, etc., can solve problems such as system difficulty and achieve fast and simple registration
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0029] figure 1 A block diagram of a speaker recognition system according to the present invention is shown. The system consists of three main units that execute consecutively in time: background learning 110 , speaker registration 120 and speaker recognition 130 . Background learning includes speech data acquisition 112, followed by blind clustering of speech utterances based on speaker characteristics. The goal of blind utterance clustering is to group unknown utterances when no initial information is available about speaker identity or even about speaker group size. The details of this part will be described below. Once the clusters are generated, speaker model 116 ensures that the utterances in each of these clusters are used to train models each belonging to a possible speaker. The model is best trained using traditional Gaussian Mixture Model (GMM) techniques, where a set of M clusters is defined by the GMM's {λ 1 c ,λ 21 c ,...,λ M c}express. Those familiar wi...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 