Neural networks based channel compensation for i-vector speaker verification

Wei Rao, Xiong Xiao, Chenglin Xu, Haihua Xu, Kongaik Lee, Eng Siong Chng, Haizhou Li

Research output: Chapter in book / Conference proceedingConference article published in proceeding or bookAcademic researchpeer-review

1 Citation (Scopus)

Abstract

Linear discriminant analysis (LDA) and Gaussian probabilistic LDA (PLDA) have been shown to effectively suppress channel-and session-variability of i-vectors. But they suffer the following limitations: 1) In LDA, a single linear transformation may not be adequate to describe the nonlinear relationship of features and 2) Gaussian-PLDA assumes the speaker and channel factors follow a Gaussian distribution, but they are actually non-Gaussians. We consider neural networks (NN) as a way to overcome the limitations, that captures the nonlinear relationship of features and does not require prior assumptions. This paper investigates three NN based channel compensation methods: deep metric learning, NN classifier, and deep denoising autoencoder and compares their performance with LDA and PLDA. Experiments conducted on NIST 2010 speaker recognition evaluation suggest that NN-based channel compensation methods are superior to LDA and that the performance of NN classifier is better than that of PLDA under most of common evaluation conditions. Additionally, this paper also helps us understand the relationships among LDA, PLDA, and NN based methods.

Original languageEnglish
Title of host publicationProceedings of 2016 10th International Symposium on Chinese Spoken Language Processing, ISCSLP 2016
EditorsHsin-Min Wang, Qingzhi Hou, Yuan Wei, Tan Lee, Jianguo Wei, Lei Xie, Hui Feng, Jianwu Dang, Jianwu Dang
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781509042937
DOIs
Publication statusPublished - 2 May 2017
Externally publishedYes
Event10th International Symposium on Chinese Spoken Language Processing, ISCSLP 2016 - Tianjin, China
Duration: 17 Oct 201620 Oct 2016

Publication series

NameProceedings of 2016 10th International Symposium on Chinese Spoken Language Processing, ISCSLP 2016

Conference

Conference10th International Symposium on Chinese Spoken Language Processing, ISCSLP 2016
Country/TerritoryChina
CityTianjin
Period17/10/1620/10/16

Keywords

  • I-vector
  • LDA
  • Neural networks
  • PLDA
  • Speaker recognition

ASJC Scopus subject areas

  • Signal Processing
  • Computer Vision and Pattern Recognition
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'Neural networks based channel compensation for i-vector speaker verification'. Together they form a unique fingerprint.

Cite this