Abstract
Research has shown that handset selectors can be used to assist telephone-based speech/speaker recognition. Most handset selectors, however, simply select the most likely handset from a set of known handsets even for speech coming from an 'unseen' handset. This paper proposes a divergence-based handset selector with out-of-handset (OOH) rejection capability to identify the 'unseen' handsets. This is achieved by measuring the Jensen difference between the selector's output and a constant vector with identical elements. The resulting handset selector is combined with a feature-based channel compensation algorithm for telephonebased speaker verification. Utterances whose handsets were identified as 'unseen' are either transformed by a global bias vector or normalized by cepstral mean subtraction (CMS). On the other hand, if the handset can be identified (considered as 'seen'), its corresponding transformation parameters will be used to transform the utterances. Experiments based on ten handsets of the HTIMIT corpus show that using the transformation parameters of the 'seen' handsets to transform the utterances with correctly identified handsets and processing those utterances with 'unseen' handsets by CMS achieve the best result.
Original language | English |
---|---|
Title of host publication | 7th International Conference on Spoken Language Processing, ICSLP 2002 |
Publisher | International Speech Communication Association |
Pages | 2329-2332 |
Number of pages | 4 |
Publication status | Published - 1 Jan 2002 |
Event | 7th International Conference on Spoken Language Processing, ICSLP 2002 - Denver, United States Duration: 16 Sept 2002 → 20 Sept 2002 |
Conference
Conference | 7th International Conference on Spoken Language Processing, ICSLP 2002 |
---|---|
Country/Territory | United States |
City | Denver |
Period | 16/09/02 → 20/09/02 |
ASJC Scopus subject areas
- Language and Linguistics
- Linguistics and Language