Abstract
This paper presents the HKCUPU speaker recognition system submitted to NIST 2010 speaker recognition evaluation (SRE). The system comprises five subsystems, each with different acoustic features, session-variability reduction methods, speaker modeling and scoring methods and classifiers. This paper reports the results of individual and fusion systems for the core test and highlights the improvements made by our newly proposed JFA-Fishervoice (FSH) subsystem. Results show that FSH outperforms JFA when its projection matrix is channel-dependent (telephone or microphone) and that FSH is complementary to other state-of-the-art techniques. It was also found that VAD is an important pre-processing step for interview speech.
Original language | English |
---|---|
Title of host publication | 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011 - Proceedings |
Pages | 5288-5291 |
Number of pages | 4 |
DOIs | |
Publication status | Published - 18 Aug 2011 |
Event | 36th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011 - Prague, Czech Republic Duration: 22 May 2011 → 27 May 2011 |
Conference
Conference | 36th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011 |
---|---|
Country/Territory | Czech Republic |
City | Prague |
Period | 22/05/11 → 27/05/11 |
Keywords
- discriminative models
- factor analysis
- Fishervoice
- NIST SRE 2010
- speaker recognition
ASJC Scopus subject areas
- Software
- Signal Processing
- Electrical and Electronic Engineering