Spoken language recognition: From fundamentals to practice

Haizhou Li, Bin Ma, Kong Aik Lee

Research output: Journal article publicationJournal articleAcademic researchpeer-review

272 Citations (Scopus)

Abstract

Spoken language recognition refers to the automatic process through which we determine or verify the identity of the language spoken in a speech sample. We study a computational framework that allows such a decision to be made in a quantitative manner. In recent decades, we have made tremendous progress in spoken language recognition, which benefited from technological breakthroughs in related areas, such as signal processing, pattern recognition, cognitive science, and machine learning. In this paper, we attempt to provide an introductory tutorial on the fundamentals of the theory and the state-of-the-art solutions, from both phonological and computational aspects. We also give a comprehensive review of current trends and future research directions using the language recognition evaluation (LRE) formulated by the National Institute of Standards and Technology (NIST) as the case studies.

Original languageEnglish
Article number6451097
Pages (from-to)1136-1159
Number of pages24
JournalProceedings of the IEEE
Volume101
Issue number5
DOIs
Publication statusPublished - Feb 2013
Externally publishedYes

Keywords

  • Acoustic features
  • calibration
  • classifier
  • fusion
  • language recognition evaluation (LRE)
  • phonotactic features
  • spoken language recognition
  • tokenization
  • vector space modeling

ASJC Scopus subject areas

  • General Computer Science
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Spoken language recognition: From fundamentals to practice'. Together they form a unique fingerprint.

Cite this