Abstract
Digit recognition is important in some applications such as automated banking systems or database information retrieving systems. To design a high performance Mandarin digit recognizer, a Mandarin phonetic question set was first carefully designed and then used to cluster 846 gender dependent cross word triphones. To model the fine differences in high frequency region of Mandarin initials, inverse mel-frequency warping was used to calculate the IMFCC feature. The IMFCC feature was shown to be quite effective in recovering the substitution errors caused by similarity of the Mandarin initials. Combined with triphone duration modeling, the recognizer produced 98.81% word accuracy rate and 95.20% sentence correct rate.
Original language | English |
---|---|
Title of host publication | ISSPA 1999 - Proceedings of the 5th International Symposium on Signal Processing and Its Applications |
Publisher | IEEE Computer Society |
Pages | 629-632 |
Number of pages | 4 |
Volume | 2 |
ISBN (Print) | 1864354518, 9781864354515 |
DOIs | |
Publication status | Published - 1 Jan 1999 |
Externally published | Yes |
Event | 5th International Symposium on Signal Processing and Its Applications, ISSPA 1999 - Brisbane, QLD, Australia Duration: 22 Aug 1999 → 25 Aug 1999 |
Conference
Conference | 5th International Symposium on Signal Processing and Its Applications, ISSPA 1999 |
---|---|
Country/Territory | Australia |
City | Brisbane, QLD |
Period | 22/08/99 → 25/08/99 |
ASJC Scopus subject areas
- Signal Processing