Abstract
The construction of kernel functions to handle sequences of speech feature vectors is crucial in using support vector machine (SVM) for speaker verification. Previous studies have reported the idea of representing speech signals as sequences of discrete acoustic or phonotactic events. This paper introduces a class of SVM kernels derived based on the expected likelihood measure between the probability distributions of discrete event sequences. We investigate and compare the effectiveness of three expected likelihood kernels using the universal background model (UBM) as the discrete event detector. Experiments conducted on the NIST 2006 speaker verification task indicate that the proposed kernel outperforms the popular rank-normalized kernel.
Original language | English |
---|---|
Pages (from-to) | 591-595 |
Number of pages | 5 |
Journal | European Signal Processing Conference |
Publication status | Published - Oct 2010 |
Externally published | Yes |
Event | 18th European Signal Processing Conference, EUSIPCO 2010 - Aalborg, Denmark Duration: 23 Aug 2010 → 27 Aug 2010 |
ASJC Scopus subject areas
- Signal Processing
- Electrical and Electronic Engineering