Factor analysis based spatial correlation modeling for speaker verification

Er Yu Wang, Wu Guo, Li Rong Dai, Kong Aik Lee, Bin Ma, Hai Zhou Li

Research output: Chapter in book / Conference proceedingConference article published in proceeding or bookAcademic researchpeer-review

5 Citations (Scopus)

Abstract

Gaussian mixture models (GMMs) are commonly used in text-independent speaker verification for modeling the spectral distribution of speech. Recent studies have shown the effectiveness of characterizing speaker information using the mean super-vector obtained by concatenating the mean vectors of the GMM. This paper proposes to use the spatial correlation captured by the covariance matrix of the mean super-vector for speaker verification. Factor analysis method is adopted to estimate the covariance of the super-vector. For measuring the similarity between speech utterances in terms of the spatial correlation, we propose two kernel metrics, namely, log-Euclidean inner product and Frobenius angle. For computational simplicity, we introduce an inner product classifier (IPC) with equivalent performance compared to the commonly used support vector machine (SVM). Experiments conducted on the 2006 NIST speaker recognition evaluation (SRE) dataset confirm the efficacy of the proposed factor analysis based spatial modeling technique.

Original languageEnglish
Title of host publication2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010 - Proceedings
Pages166-170
Number of pages5
DOIs
Publication statusPublished - Jan 2010
Externally publishedYes
Event2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010 - Tainan, Taiwan
Duration: 29 Nov 20103 Dec 2010

Publication series

Name2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010 - Proceedings

Conference

Conference2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010
Country/TerritoryTaiwan
CityTainan
Period29/11/103/12/10

Keywords

  • Factor analysis
  • Frobenius angle
  • Inner product classifier
  • Log-Euclidean distance

ASJC Scopus subject areas

  • Linguistics and Language

Fingerprint

Dive into the research topics of 'Factor analysis based spatial correlation modeling for speaker verification'. Together they form a unique fingerprint.

Cite this