Abstract
Conventional PLDA scoring in i-vector speaker verification involves the i-vectors of target speakers and claimants only. We have previously demonstrated that better performance can be achieved by incorporating the information of background speakers in the scoring process via speaker-dependent SVMs. This is achieved by defining a PLDA score space with dimension equal to the number of training i-vectors for each target speaker. The new protocol in NIST 2012 SRE permits systems to use the information of other target-speakers (called known non-targets) in each verification trial. In this paper, we exploit this new protocol to enhance the performance of PLDA-SVM scoring by using the score vectors of both known and unknown non-targets as the impostor class data to train the speaker-dependent SVMs. Because some target speakers have one enrollment utterance only, which results in severe imbalance in the speaker- and impostor-class data for SVM training. This paper shows that if the enrollment utterance is sufficiently long, a number of target-speaker i-vectors can be generated by an utterance partitioning and resampling technique, resulting in much better scoring SVMs. Results on NIST 2012 SRE demonstrate the advantages of pooling the known and unknown non-targets for training the SVMs and that the resampling techniques can help the SVM training algorithm to find better decision boundaries for those speakers with only a small number of enrollment utterances.
Original language | English |
---|---|
Title of host publication | 2014 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2014 |
Publisher | IEEE |
Pages | 4012-4016 |
Number of pages | 5 |
ISBN (Print) | 9781479928927 |
DOIs | |
Publication status | Published - 1 Jan 2014 |
Event | 2014 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2014 - Florence, Italy Duration: 4 May 2014 → 9 May 2014 |
Conference
Conference | 2014 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2014 |
---|---|
Country/Territory | Italy |
City | Florence |
Period | 4/05/14 → 9/05/14 |
Keywords
- empirical kernel maps
- I-vectors
- likelihood ratio kernels
- NIST 2012 SRE
- probabilistic linear discriminant analysis
ASJC Scopus subject areas
- Software
- Signal Processing
- Electrical and Electronic Engineering