Abstract
The 3D talking head has been developed fast, in which both external and internal articulators were demonstrated. For Mandarin pronunciation, the aspiration airflow is crucial to discriminate confusable Mandarin consonants. In this paper, we present a 3D talking head system for articulatory and aspiration animation with the use of EMA articulation data and airflow data simultaneously. The quantitative analyses of airflow data indicated confusable Mandarin consonants could be distinguished from each other by the mean airflow during voicing, peak expiratory airflow, and airflow duration. An airflow model was then incorporated into the 3D articulatory model to produce the airflow in accordance with articulator movements of Mandarin pronunciation. An audio-visual test was designed to evaluate the current 3D articulation and aspiration system, where minimal pairs were used to recognize the animation. The identification accuracy was significantly improved from 43.9% without airflow to 84.8% with airflow-incorporated information.
Original language | English |
---|---|
Title of host publication | 2016 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2016 - Proceedings |
Publisher | IEEE |
Pages | 6130-6134 |
Number of pages | 5 |
Volume | 2016-May |
ISBN (Electronic) | 9781479999880 |
DOIs | |
Publication status | Published - 18 May 2016 |
Externally published | Yes |
Event | 41st IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2016 - Shanghai International Convention Center, Shanghai, China Duration: 20 Mar 2016 → 25 Mar 2016 |
Conference
Conference | 41st IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2016 |
---|---|
Country/Territory | China |
City | Shanghai |
Period | 20/03/16 → 25/03/16 |
Keywords
- 3D articulatory dynamics
- Airflow
- confusable consonants
- intelligible enhancement
- PAS
ASJC Scopus subject areas
- Software
- Signal Processing
- Electrical and Electronic Engineering