Advances in mandarin broadcast speech recognition

Mei Yuh Hwang, Wen Wang, Xin Lei, Jing Zheng, Ozgur Cetin, Gang Peng

Research output: Chapter in book / Conference proceedingConference article published in proceeding or bookAcademic researchpeer-review

6 Citations (Scopus)

Abstract

We describe our continuing efforts to improve the UW-SRI-ICSI Mandarin broadcast speech recognizer. This includes increasing acoustic and text training data, adding discriminative features, incorporating frame-level discriminative training criterion, multiple-pass acoustic model (AM) cross adaptation, language model (LM) genre adaptation and system combination. The net effect without LM adaptation was a 24%-64% relative reduction in character error rates (CERs) on a variety of test sets. In addition, LM adaptation gave us another 6% of relative CER reduction on broadcast conversations.
Original languageEnglish
Title of host publicationInternational Speech Communication Association - 8th Annual Conference of the International Speech Communication Association, Interspeech 2007
Pages2876-2879
Number of pages4
Volume4
Publication statusPublished - 1 Dec 2007
Externally publishedYes
Event8th Annual Conference of the International Speech Communication Association, Interspeech 2007 - Antwerp, Belgium
Duration: 27 Aug 200731 Aug 2007

Conference

Conference8th Annual Conference of the International Speech Communication Association, Interspeech 2007
Country/TerritoryBelgium
CityAntwerp
Period27/08/0731/08/07

Keywords

  • Character error rates
  • Discriminative features
  • Discriminative training
  • LM adaptation
  • Mandarin

ASJC Scopus subject areas

  • Computer Science Applications
  • Software
  • Modelling and Simulation
  • Linguistics and Language
  • Communication

Cite this