Comparing Multi-layer Perceptrons and Radial Basis Functions networks in speaker recognition

Man Wai Mak, W. G. Allen, G. G. Sexton

Research output: Journal article publicationJournal articleAcademic researchpeer-review

4 Citations (Scopus)

Abstract

We have compared the performance of Multi-layer Perceptrons networks (MLP) and Radial Basis Function networks (RBF) in the task of speaker identification. The experiments are carried out on 400 utterances (10 digits, in English) from 10 speakers. LPC-derived Cepstrum Coefficients are used as the speaker specific features. The results show that the MLP networks are superior in memory usage and classification time. Nevertheless, they suffer from long training time and the classification performance is poorer than that of the RBF networks. The function centres of the RBF networks are either selected randomly from the training data or located by a K-mean algorithm. We find that K-mean clusteirng is an effective method in locating the function centres. We also find that by guaranteeing every speaker has similar number of function centres, the recognition performance can be improved further.
Original languageEnglish
Pages (from-to)147-159
Number of pages13
JournalJournal of Microcomputer Applications
Volume16
Issue number2
DOIs
Publication statusPublished - 1 Apr 1993
Externally publishedYes

ASJC Scopus subject areas

  • Engineering(all)

Cite this