Abstract
This paper proposes a two-level fusion strategy for audio-visual biometric authentication. Specifically, fusion is performed at two levels: intramodal and intermodal. In intramodal fusion, the scores of multiple samples (e.g. utterances or video shots) obtained from the same modality are linearly combined, where the combination weights depend on the difference between the score values and a client-dependent reference score obtained during enrollment. This is followed by intermodal fusion in which the means of intramodal fused scores obtained from different modalities are either linearly combined or fused by a support vector machine (SVM). Experimental results based on the XM2VTSDB corpus show that intramodal and intermodal fusion are complementary to each other and that SVM-based intermodal fusion is superior to linear combination.
Original language | English |
---|---|
Title of host publication | 2005 IEEE ICASSP '05 - Proc. - Design and Implementation of Signal Proces.Syst.,Indust. Technol. Track,Machine Learning for Signal Proces. Education, Spec. Sessions |
Publisher | IEEE |
Volume | V |
ISBN (Print) | 0780388747, 9780780388741 |
DOIs | |
Publication status | Published - 1 Jan 2005 |
Event | 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05 - Philadelphia, PA, United States Duration: 18 Mar 2005 → 23 Mar 2005 |
Conference
Conference | 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05 |
---|---|
Country/Territory | United States |
City | Philadelphia, PA |
Period | 18/03/05 → 23/03/05 |
ASJC Scopus subject areas
- Electrical and Electronic Engineering
- Signal Processing
- Acoustics and Ultrasonics