Abstract
The cohort and world models are commonly used for scoring normalization in speaker verification. As these models represent different regions of the feature space, a better solution could be obtained by integrating them into a single framework. In this paper, we embed the two models in elliptical basis function networks and propose a two-stage decision procedure for improving verification performance. In the first stage, the score of an unknown utterance is normalized by a world model. If the difference between the resulting normalized score and a world threshold is sufficiently large, the claimant is accepted or rejected immediately. Otherwise, the score will be normalized by a cohort model and compared with a cohort threshold to make a final accept/reject decision. Experimental evaluations based on the YOHO corpus suggest that the two-stage method achieves a lower error rate as compared to the case where only one background model is used.
Original language | English |
---|---|
Title of host publication | Signal Processing Theory and Methods IIAudio and ElectroacusticsSpeech Processing I |
Publisher | IEEE |
Pages | 1193-1196 |
Number of pages | 4 |
Volume | 2 |
ISBN (Electronic) | 0780362934 |
DOIs | |
Publication status | Published - 1 Jan 2000 |
Event | 25th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2000 - Hilton Hotel and Convention Center, Istanbul, Turkey Duration: 5 Jun 2000 → 9 Jun 2000 |
Conference
Conference | 25th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2000 |
---|---|
Country/Territory | Turkey |
City | Istanbul |
Period | 5/06/00 → 9/06/00 |
ASJC Scopus subject areas
- Software
- Signal Processing
- Electrical and Electronic Engineering