Fusion of acoustic and tokenization features for speaker recognition

  • Rong Tong
  • , Bin Ma
  • , Kong Aik Lee
  • , Changhuai You
  • , Donglai Zhu
  • , Tomi Kinnunen
  • , Hanwu Sun
  • , Minghui Dong
  • , Eng Siong Chng
  • , Haizhou Li

Research output: Chapter in book / Conference proceedingConference article published in proceeding or bookAcademic researchpeer-review

Abstract

This paper describes our recent efforts in exploring effective discriminative features for speaker recognition. Recent researches have indicated that the appropriate fusion of features is critical to improve the performance of speaker recognition system. In this paper we describe our approaches for the NIST 2006 Speaker Recognition Evaluation. Our system integrated the cepstral GMM modeling, cepstral SVM modeling and tokenization at both phone level and frame level. The experimental results on both NIST 2005 SRE corpus and NIST 2006 SRE corpus are presented. The fused system achieved 8.14% equal error rate on 1conv4w-1conv4w test condition of the NIST 2006 SRE.

Original languageEnglish
Title of host publicationChinese Spoken Language Processing - 5th International Symposium, ISCSLP 2006, Proceedings
Pages566-577
Number of pages12
DOIs
Publication statusPublished - 2006
Externally publishedYes
Event5th International Symposium on Chinese Spoken Language Processing, ISCSLP 2006 - Singapore, Singapore
Duration: 13 Dec 200616 Dec 2006

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume4274 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference5th International Symposium on Chinese Spoken Language Processing, ISCSLP 2006
Country/TerritorySingapore
CitySingapore
Period13/12/0616/12/06

Keywords

  • Cepstral feature
  • Gaussian mixture model
  • Phonotactic feature
  • Speaker recognition
  • Support vector machine
  • Tokenization, fusion

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'Fusion of acoustic and tokenization features for speaker recognition'. Together they form a unique fingerprint.

Cite this