SNR-invariant PLDA modeling in nonparametric subspace for robust speaker verification

Research output: Journal article publicationJournal articleAcademic researchpeer-review

28 Citations (Scopus)

Abstract

While i-vector/PLDA framework has achieved great success, its performance still degrades dramatically under noisy conditions. To compensate for the variability of i-vectors caused by different levels of background noise, this paper proposes an SNR-invariant PLDA framework for robust speaker verification. First, nonparametric feature analysis (NFA) is employed to suppress intra-speaker variation and emphasize the discriminative information inherited in the boundaries between speakers in the i-vector space. Then, in the NFA-projected subspace, SNR-invariant PLDA is applied to separate the SNR-specific information from speaker-specific information using an identity factor and an SNR factor. Accordingly, a projected i-vector in the NFA subspace can be represented as a linear combination of three components: speaker, SNR, and channel. During verification, the variability due to SNR and channels are integrated out when computing the marginal likelihood ratio. Experiments based on NIST 2012 SRE show that the proposed framework achieves superior performance when compared with the conventional PLDA and SNR-dependent mixture of PLDA.
Original languageEnglish
Article number7120100
Pages (from-to)1648-1659
Number of pages12
JournalIEEE Transactions on Audio, Speech and Language Processing
Volume23
Issue number10
DOIs
Publication statusPublished - 1 Oct 2015

Keywords

  • i-vector
  • nonparametric feature analysis
  • probabilistic linear discriminant analysis (PLDA)
  • SNR-invariant
  • speaker verification

ASJC Scopus subject areas

  • Acoustics and Ultrasonics
  • Electrical and Electronic Engineering

Cite this