Jointly Modelling Transcriptions and Phonemes with Optimal Features to Detect Dementia from Spontaneous Cantonese

Xiaoquan Ke, Man Wai Mak, Helen M. Meng

Research output: Chapter in book / Conference proceedingConference article published in proceeding or bookAcademic researchpeer-review

Abstract

Dementia is a severe cognitive impairment that affects the health of older adults. This paper analyzes diverse speech-based features extracted from spoken languages and selects the most discriminative ones for dementia detection. We propose a two-step feature selection method to handle the circumstance where the feature dimension is far larger than the number of training samples. Recently, the performance of dementia detection has been significantly improved by utilizing Transformer-based models that automatically capture the linguistic properties of spoken languages. We combine features extracted from BERT with selected speech-based features to enhance dementia detection performance. We propose a novel strategy to model the transcriptions and their phonemes using BERT and phoneme-BERT. The proposed method is evaluated on a Cantonese dataset called CU-Marvel, which contains 185 healthy older adults, 98 older adults having minor neurocognitive disorders (minor NCD), and 26 older adults suffering from major NCD. Experimental results show that simultaneously fine-tuning the BERT and phoneme-BERT can leverage information from the recognized phonemes and make the detection performance robust to automatic speech recognition errors. Simultaneous fine-tuning of the BERT and phoneme-BERT models results in a 6% improvement in F1 scores, compared to fine-tuning the BERT model alone.

Original languageEnglish
Title of host publication2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages2267-2273
Number of pages7
ISBN (Electronic)9798350300673
DOIs
Publication statusPublished - Nov 2023
Event2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2023 - Taipei, Taiwan
Duration: 31 Oct 20233 Nov 2023

Publication series

Name2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2023

Conference

Conference2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2023
Country/TerritoryTaiwan
CityTaipei
Period31/10/233/11/23

Keywords

  • Dementia detection
  • feature selection
  • phoneme-BERT

ASJC Scopus subject areas

  • Hardware and Architecture
  • Signal Processing
  • Artificial Intelligence
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'Jointly Modelling Transcriptions and Phonemes with Optimal Features to Detect Dementia from Spontaneous Cantonese'. Together they form a unique fingerprint.

Cite this