Target-aware language models for spoken language recognition

Rong Tong, Bin Ma, Haizhou Li, Eng Siong Chng, Kong Aik Lee

Research output: Journal article publicationConference articleAcademic researchpeer-review

7 Citations (Scopus)

Abstract

This paper studies a new way of constructing multiple phone tokenizers for language recognition. In this approach, each phone tokenizer for a target language will share a common set of acoustic models, while each tokenizer will have a unique phone-based language model (LM) trained for a specific target language. The target-aware language models (TALM) are constructed to capture the discriminative ability of individual phones for the desired target languages. The parallel phone tokenizers thus formed are shown to achieve better performance than the original phone recognizer. The proposed TALM is very different from the LM in the traditional PPRLM technique. First of all, the TALM applies the LM information in the front-end as opposed to PPRLM approach which uses a LM in the system back-end; Furthermore, the TALM exploits the discriminative phones occurrence statistics, which are different from the traditional n-gram statistics in PPRLM approach. A novel way of training TALM is also studied in this paper. Our experimental results show that the proposed method consistently improves the language recognition performance on NIST 1996, 2003 and 2007 LRE 30-second closed test sets.

Original languageEnglish
Pages (from-to)200-203
Number of pages4
JournalProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Publication statusPublished - Sept 2009
Externally publishedYes
Event10th Annual Conference of the International Speech Communication Association, INTERSPEECH 2009 - Brighton, United Kingdom
Duration: 6 Sept 200910 Sept 2009

Keywords

  • Parallel phone tokenizer
  • Spoken language recognition
  • Target-aware language model
  • Target-oriented phone tokenizer
  • Universal phone recognizer

ASJC Scopus subject areas

  • Human-Computer Interaction
  • Signal Processing
  • Software
  • Sensory Systems

Fingerprint

Dive into the research topics of 'Target-aware language models for spoken language recognition'. Together they form a unique fingerprint.

Cite this