Skip to main navigation Skip to search Skip to main content

Room-scale Voice Liveness Detection for Smart Devices

Research output: Journal article publicationJournal articleAcademic researchpeer-review

Abstract

Voice assistants are widely integrated into a variety of mobile devices, enabling users to easily complete daily tasks and even critical operations like online transactions with voice commands. Thus, once attackers replay a secretly-recorded voice command by loudspeakers to compromise users' voice assistants, this operation will cause serious consequences, such as information leakage and property loss. Unfortunately, most voice liveness detection approaches against replay attacks mainly rely on detecting lip motions or subtle physiological features in speech, which are limited within a very short range. In this paper, we propose VoShield to check whether a voice command is from a genuine user or a loudspeaker imposter. VoShield measures sound field dynamics, a feature that changes fast as the human mouths dynamically open and close. In contrast, it would remain rather stable for loudspeakers due to the fixed size. This feature enables VoShield to largely extend the working distance and remain resilient to user locations. Besides, sound field dynamics are extracted from the difference between multiple microphone channels, making this feature robust to voice volume. To evaluate VoShield, we conducted comprehensive experiments with various settings in different working scenarios. The results show that VoShield can achieve a detection accuracy of 98.2% and an Equal Error Rate of 2.0%, which serves as a promising complement to current voice authentication systems for smart mobile devices.

Original languageEnglish
Pages (from-to)1-14
Number of pages14
JournalIEEE Transactions on Dependable and Secure Computing
DOIs
Publication statusAccepted/In press - 2024

Keywords

  • Apertures
  • Feature extraction
  • Liveness Detection
  • Loudspeakers
  • Microphone Array
  • Microphone arrays
  • Mouth
  • Personal voice assistants
  • Replay Attack
  • Smart devices
  • Voice Assistant

ASJC Scopus subject areas

  • General Computer Science
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Room-scale Voice Liveness Detection for Smart Devices'. Together they form a unique fingerprint.

Cite this