Cross-Domain adaptation in Distance Space for Speaker Verification

Research output: Chapter in book / Conference proceedingConference article published in proceeding or bookAcademic researchpeer-review

1 Citation (Scopus)

Abstract

Significant performance degradation often occurs when a well-trained speaker verification system is applied to an unseen domain. Data augmentation and domain adaptation are two common approaches to tackling this problem. However, data augmentation would not be helpful when language mismatch rather than environment noise causes the domain shift. Domain adaptation also suffers from a label mismatch problem, making feature distribution alignment unreliable. We propose incorporating a distance metric space into model adaptation to address these issues. The idea is to align not only the embeddings across domains but also the distributions of their pairwise distances, resulting in embeddings tolerant to domain shift. To validate the idea, we used the non-Chinese utterances in VoxCeleb2 and the Chinese utterances in CN-Celeb2 as the source and target domain training data, respectively. Results show that the alignments reduce the EER on the CN-Celeb1 test set by 15.2%.

Original languageEnglish
Title of host publication2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages2238-2243
Number of pages6
ISBN (Electronic)9798350300673
DOIs
Publication statusPublished - Nov 2023
Event2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2023 - Taipei, Taiwan
Duration: 31 Oct 20233 Nov 2023

Publication series

Name2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2023

Conference

Conference2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2023
Country/TerritoryTaiwan
CityTaipei
Period31/10/233/11/23

Keywords

  • distance alignment
  • distance metric space
  • Domain shift
  • speaker embedding

ASJC Scopus subject areas

  • Hardware and Architecture
  • Signal Processing
  • Artificial Intelligence
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'Cross-Domain adaptation in Distance Space for Speaker Verification'. Together they form a unique fingerprint.

Cite this