Abstract
Previous studies have demonstrated the benefits of PLDA-SVM scoring with empirical kernel maps for i-vector/PLDA speaker verification. The method not only performs significantly better than the conventional PLDA scoring and utilizes the multiple enrollment utterances of target speakers effectively, but also opens up opportunity for adopting sparse kernel machines in PLDA-based speaker verification systems. This paper proposes taking the advantages of empirical kernel maps by incorporating them into a more advanced kernel machine called relevance vector machines (RVMs). The paper reports extensive analyses on the behaviors of RVMs and provides insight into the properties of RVMs and their applications in i-vector/PLDA speaker verification. Results on NIST 2012 SRE demonstrate that PLDA-RVM outperforms the conventional PLDA and that it achieves a comparable performance as PLDA-SVM. Results also show that PLDA-RVM is much sparser than PLDA-SVM.
Original language | English |
---|---|
Pages (from-to) | 104-121 |
Number of pages | 18 |
Journal | Computer Speech and Language |
Volume | 38 |
DOIs | |
Publication status | Published - 1 Jul 2016 |
Keywords
- Empirical kernel maps
- I-vectors
- NIST SRE
- Probabilistic linear discriminant analysis
- Relevance vector machines
ASJC Scopus subject areas
- Software
- Theoretical Computer Science
- Human-Computer Interaction