Abstract
Automatic detection of backchannel has great potential to enhance artificial mediators, which indicate listeners' attention and agreement in human communication. It is often expressed by subtle non-verbal cues that occur briefly and sparsely. Focusing on identifying and locating these subtle cues (i.e., their occurrence moment and the involved body parts), this paper proposes a novel approach for backchannel detection. In particular, our model utilizes temporal- and modality-attention modules to determine and lead the model to pay more attention to both the indicative moment and the accompanying body parts at that specific time. It achieves an accuracy of 68.6% on the testing set in MultiMediate'23 backchannel detection challenge, outperforming the counterparts. Furthermore, we conducted an ablation study to thoroughly understand the contributions of our model. This study underscores the effectiveness of our selection of modality inputs and the importance of the two attention modules in our model.
Original language | English |
---|---|
Title of host publication | MM 2023 - Proceedings of the 31st ACM International Conference on Multimedia |
Publisher | Association for Computing Machinery, Inc |
Pages | 9586-9590 |
Number of pages | 5 |
ISBN (Electronic) | 9798400701085 |
DOIs | |
Publication status | Published - 26 Oct 2023 |
Event | 31st ACM International Conference on Multimedia, MM 2023 - Ottawa, Canada Duration: 29 Oct 2023 → 3 Nov 2023 |
Publication series
Name | MM 2023 - Proceedings of the 31st ACM International Conference on Multimedia |
---|
Conference
Conference | 31st ACM International Conference on Multimedia, MM 2023 |
---|---|
Country/Territory | Canada |
City | Ottawa |
Period | 29/10/23 → 3/11/23 |
Keywords
- attention models
- backchannel detection
- visual cues
ASJC Scopus subject areas
- Artificial Intelligence
- Computer Graphics and Computer-Aided Design
- Human-Computer Interaction
- Software
Fingerprint
Dive into the research topics of 'Unveiling Subtle Cues: Backchannel Detection Using Temporal Multimodal Attention Networks'. Together they form a unique fingerprint.Prizes
-
1st Place in the Backchannel Detection Challenge at ACM MM'23
Wang, K. (Recipient), Cheung, M. K. M. (Recipient), Zhang, Y. (Recipient), Yang, C. (Recipient), Chen, Q. (Recipient), Fu, Y. (Recipient) & Ngai, G. (Recipient), 3 Nov 2023
Prize: Prize (research)
File