Abstract
Device, language and environmental mismatch adversely affect speaker verification (SV) performance. We investigate such effects empirically based on the M3 (multibiometric, multilingual and multi-device) Corpus [1]. Device mismatch (among 3G phone, PocketPC and a desktop PC plug-in microphone) brings relative performance degradation of 523%; language mismatch (between English and Cantonese) brings 284% and environmental mismatch (between office environment and recording studio) brings 109%. In particular, verification with wide-band models on narrow-band test data outperforms narrow-band models on wide-band test data. The 3G phone's SV performance is generally low, but remains stable across environments. Additionally, durational variations within two-second utterances may cause a relative change of 633% in SV performance.
| Original language | English |
|---|---|
| Title of host publication | 2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07 |
| Volume | 4 |
| DOIs | |
| Publication status | Published - 6 Aug 2007 |
| Event | 2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07 - Honolulu, HI, United States Duration: 15 Apr 2007 → 20 Apr 2007 |
Conference
| Conference | 2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07 |
|---|---|
| Country/Territory | United States |
| City | Honolulu, HI |
| Period | 15/04/07 → 20/04/07 |
Keywords
- Biometrics corpus
- M3 speaker verification evaluation
- Speaker verification
ASJC Scopus subject areas
- Software
- Signal Processing
- Electrical and Electronic Engineering
Fingerprint
Dive into the research topics of 'Effects of device mismatch, language mismatch and environmental mismatch on speaker verification'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver