Speaker verification via articulatory feature-based conditional pronunciation modeling with vowel and consonant mixture models

Ka Yee Leung, Man Wai Mak, Manhung Siu, Sun Yuan Kung

Research output: Chapter in book / Conference proceedingConference article published in proceeding or bookAcademic researchpeer-review

Abstract

Articulatory feature-based conditional pronunciation modeling (AFCPM) aims to capture the pronunciation characteristics of speakers by modeling the linkage between the states of articulation during speech production and the actual phones produced by a speaker. Previous AFCPM systems use one discrete density function for each phoneme to model the pronunciation characteristics of speakers. This paper proposes using a mixture of discrete density functions for AFCPM. In particular, the pronunciation characteristics of each phoneme is modeled by two density functions: one responsible for describing the articulatory features that are more relevant to vowels and the other for consonants. Verification scores are the weighted sum of the outputs of the two models. To enhance the resolution of the pronunciation models, four articulatory properties (front-back, liprounding, place of articulation, and manner of articulation) are used for pronunciation modeling. The proposed AFCPM is applied to a speaker verification task. Results show that using four articulatory features achieves a lower error rate as compared to using two features (manner and place of articulation) only. It was also found that dividing the articulatory properties into two groups is an effective means of solving the data-sparseness problem encountered in the training phase of AFCPM systems.
Original languageEnglish
Title of host publication9th European Conference on Speech Communication and Technology
Pages3089-3092
Number of pages4
Publication statusPublished - 1 Dec 2005
Event9th European Conference on Speech Communication and Technology - Lisbon, Portugal
Duration: 4 Sept 20058 Sept 2005

Conference

Conference9th European Conference on Speech Communication and Technology
Country/TerritoryPortugal
CityLisbon
Period4/09/058/09/05

ASJC Scopus subject areas

  • General Engineering

Fingerprint

Dive into the research topics of 'Speaker verification via articulatory feature-based conditional pronunciation modeling with vowel and consonant mixture models'. Together they form a unique fingerprint.

Cite this