Skip to main navigation Skip to search Skip to main content

A Study on Using Duration and Formant Features in Automatic Detection of Speech Sound Disorder in Children

Research output: Journal article publicationConference articleAcademic researchpeer-review

Abstract

Speech sound disorder (SSD) in children is manifested by persistent articulation and phonological errors on specific phonemes of a language. Automatic SSD detection can be done using features extracted from deep neural network models. Interpretability of such learned features is a major concern. Motivated by clinical knowledge, the use of duration and formant features for SSD detection is investigated in this research. Acoustical analysis is performed to identify the acoustic features that differentiate between the speech of typical and disordered children. On the task of SSD detection in Cantonese-speaking children, the duration features are found to outperform the formant features and surpass previous methods that use paralinguistic feature set and speaker embeddings. Specifically, the duration features achieve a mean unweighted average recall of 71.0%. The results enhance the understanding of SSD, and motivate further use of temporal information of child speech in SSD detection.
Original languageEnglish
Pages (from-to)4643-4647
JournalProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Volume2023
DOIs
Publication statusPublished - Aug 2023
Event24th International Speech Communication Association, Interspeech 2023 - Dublin, Ireland
Duration: 20 Aug 202324 Aug 2023

Keywords

  • child speech
  • speech sound disorder
  • detection of pathological speech
  • vowel duration
  • formant frequency

Fingerprint

Dive into the research topics of 'A Study on Using Duration and Formant Features in Automatic Detection of Speech Sound Disorder in Children'. Together they form a unique fingerprint.

Cite this