Posterior weights and Gaussian selection for spoken language recognition

Kong Aik Lee, Changhuai You, Haizhou Li

Research output: Unpublished conference presentation (presented paper, abstract, poster)Conference presentation (not published in journal/proceeding/book)Academic researchpeer-review

Abstract

This paper investigates the use of the posterior weights of GMMs for spoken language recognition, the goal of which is to determine the language spoken in speech utterances. Since the modeling of distribution is based on the component weights, the number of components in the GMM has to be sufficiently large so as to provide enough degree of freedom. To this end, Gaussian selection technique is applied to speed up the run-time computation. Another problem in using posterior weights is the distance metric. We treat the posterior weights as the probability mass function of a discrete random variable, for which rigorous similarity measures can be easily defined. The proposed language recognition system achieves state-of-the-art performance on the 1996, 2003, 2005 and 2007 National Institute of Standards and Technology (NIST) language recognition tasks.

Original languageEnglish
Pages801-804
Number of pages4
Publication statusPublished - Oct 2009
Externally publishedYes
EventAsia-Pacific Signal and Information Processing Association 2009 Annual Summit and Conference, APSIPA ASC 2009 - Sapporo, Japan
Duration: 4 Oct 20097 Oct 2009

Conference

ConferenceAsia-Pacific Signal and Information Processing Association 2009 Annual Summit and Conference, APSIPA ASC 2009
Country/TerritoryJapan
CitySapporo
Period4/10/097/10/09

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Information Systems
  • Electrical and Electronic Engineering
  • Communication

Fingerprint

Dive into the research topics of 'Posterior weights and Gaussian selection for spoken language recognition'. Together they form a unique fingerprint.

Cite this