VoShield: Voice Liveness Detection with Sound Field Dynamics

Qiang Yang, Kaiyan Cui, Yuanqing Zheng

Research output: Chapter in book / Conference proceedingConference article published in proceeding or bookAcademic researchpeer-review

7 Citations (Scopus)

Abstract

Voice assistants are widely integrated into a variety of smart devices, enabling users to easily complete daily tasks and even critical operations like online transactions with voice commands. Thus, once attackers replay a secretly-recorded voice command by loudspeakers to compromise users' voice assistants, this operation will cause serious consequences, such as information leakage and property loss. Unfortunately, most voice liveness detection approaches against replay attacks mainly rely on detecting lip motions or subtle physiological features in speech, which are limited within a very short range. In this paper, we propose VoShield to check whether a voice command is from a genuine user or a loudspeaker imposter. VoShield measures sound field dynamics, a feature that changes fast as the human mouths dynamically open and close. In contrast, it would remain rather stable for loudspeakers due to the fixed size. This feature enables VoShield to largely extend the working distance and remain resilient to user locations. Besides, sound field dynamics are extracted from the difference between multiple microphone channels, making this feature robust to voice volume. To evaluate VoShield, we conducted comprehensive experiments with various settings in different working scenarios. The results show that VoShield can achieve a detection accuracy of 98.2% and an Equal Error Rate of 2.0%, which serves as a promising complement to current voice authentication systems for smart devices.

Original languageEnglish
Title of host publicationINFOCOM 2023 - IEEE Conference on Computer Communications
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1-10
ISBN (Electronic)9798350334142
DOIs
Publication statusPublished - Aug 2023
Event42nd IEEE International Conference on Computer Communications, INFOCOM 2023 - Hybrid, New York City, United States
Duration: 17 May 202320 May 2023

Publication series

NameProceedings - IEEE INFOCOM
Volume2023-May
ISSN (Print)0743-166X

Conference

Conference42nd IEEE International Conference on Computer Communications, INFOCOM 2023
Country/TerritoryUnited States
CityHybrid, New York City
Period17/05/2320/05/23

Keywords

  • Liveness Detection
  • Microphone Array
  • Replay Attack
  • Voice Assistant

ASJC Scopus subject areas

  • General Computer Science
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'VoShield: Voice Liveness Detection with Sound Field Dynamics'. Together they form a unique fingerprint.

Cite this