Deep Learning Meets Private Talk: Conversational AI Can Predict Speaker Traits by Eavesdropping for only 30 Seconds

Andreas Liesenfeld, Gábor Parti, Chu Ren Huang

Research output: Chapter in book / Conference proceedingConference article published in proceeding or bookAcademic researchpeer-review

1 Citation (Scopus)

Abstract

Conversational AI such as smart speakers placed in home environments can accidentally activate and record people's talk for a short time. What can such devices learn about people by listening in on ongoing conversations? Taking two commonly used speaker traits as an example, we present the results of an experiment that simulates Conversational AI eavesdropping on ongoing talk using transcriptions of naturalistic conversations in private settings. We show that a currently popular type of deep learning-based system can reliably predict if a speaker is "young", "old", "female"or "male"(age=99%, gender=82%) based on what they say in around 30 seconds. Our results exemplify how powerful current big data language models are when it comes to data-driven predictions of personal information based on how people talk, even when listening only for a short time. We conclude the experiment with a critical comment on the increasingly pervasive use of such user modeling technology to compute speaker traits, touching upon some potential ethical concerns, bias, and privacy issues.

Original languageEnglish
Title of host publicationTagungsband - Mensch und Computer 2021
EditorsBastian Pfleging, Dagmar Kern, Stefan Schneegass
PublisherAssociation for Computing Machinery
Pages547-551
Number of pages5
ISBN (Electronic)9781450386456
DOIs
Publication statusPublished - 13 Sept 2021
Event2021 Conference on Mensch und Computer, MuC 2021 - 2021 Conference on Humans and Computers, MuC 2021 - Virtual, Online, Germany
Duration: 5 Sept 20218 Sept 2021

Publication series

NameACM International Conference Proceeding Series

Conference

Conference2021 Conference on Mensch und Computer, MuC 2021 - 2021 Conference on Humans and Computers, MuC 2021
Country/TerritoryGermany
CityVirtual, Online
Period5/09/218/09/21

Keywords

  • Conversational AI
  • ethics and bias
  • smart speakers

ASJC Scopus subject areas

  • Software
  • Human-Computer Interaction
  • Computer Vision and Pattern Recognition
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'Deep Learning Meets Private Talk: Conversational AI Can Predict Speaker Traits by Eavesdropping for only 30 Seconds'. Together they form a unique fingerprint.

Cite this