Unveiling Subtle Cues: Backchannel Detection Using Temporal Multimodal Attention Networks

Kangzhong Wang, M. K.Michael Cheung, Youqian Zhang, Chunxi Yang, Peter Q. Chen, Eugene Yujun Fu, Grace Ngai

Research output: Chapter in book / Conference proceedingConference article published in proceeding or bookAcademic researchpeer-review

1 Citation (Scopus)

Abstract

Automatic detection of backchannel has great potential to enhance artificial mediators, which indicate listeners' attention and agreement in human communication. It is often expressed by subtle non-verbal cues that occur briefly and sparsely. Focusing on identifying and locating these subtle cues (i.e., their occurrence moment and the involved body parts), this paper proposes a novel approach for backchannel detection. In particular, our model utilizes temporal- and modality-attention modules to determine and lead the model to pay more attention to both the indicative moment and the accompanying body parts at that specific time. It achieves an accuracy of 68.6% on the testing set in MultiMediate'23 backchannel detection challenge, outperforming the counterparts. Furthermore, we conducted an ablation study to thoroughly understand the contributions of our model. This study underscores the effectiveness of our selection of modality inputs and the importance of the two attention modules in our model.

Original languageEnglish
Title of host publicationMM 2023 - Proceedings of the 31st ACM International Conference on Multimedia
PublisherAssociation for Computing Machinery, Inc
Pages9586-9590
Number of pages5
ISBN (Electronic)9798400701085
DOIs
Publication statusPublished - 26 Oct 2023
Event31st ACM International Conference on Multimedia, MM 2023 - Ottawa, Canada
Duration: 29 Oct 20233 Nov 2023

Publication series

NameMM 2023 - Proceedings of the 31st ACM International Conference on Multimedia

Conference

Conference31st ACM International Conference on Multimedia, MM 2023
Country/TerritoryCanada
CityOttawa
Period29/10/233/11/23

Keywords

  • attention models
  • backchannel detection
  • visual cues

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Graphics and Computer-Aided Design
  • Human-Computer Interaction
  • Software

Fingerprint

Dive into the research topics of 'Unveiling Subtle Cues: Backchannel Detection Using Temporal Multimodal Attention Networks'. Together they form a unique fingerprint.
  • 1st Place in the Backchannel Detection Challenge at ACM MM'23

    Wang, K. (Recipient), Cheung, M. K. M. (Recipient), Zhang, Y. (Recipient), Yang, C. (Recipient), Chen, Q. (Recipient), Fu, Y. (Recipient) & Ngai, G. (Recipient), 3 Nov 2023

    Prize: Prize (research)

    File

Cite this