A Survey on Text-Dependent and Text-Independent Speaker Verification

Research output: Journal article publicationReview articleAcademic researchpeer-review

13 Citations (Scopus)

Abstract

Speaker verification (SV) aims to detect an individual's identity from his/her voice. SV has been successfully applied in various areas such as access control, remote service customization, financial transactions, etc. Depending on whether the text content is pre-defined or not, SV can be text-dependent or text-independent. This paper reviews recent research on text-dependent SV (TD-SV) and text-independent SV (TI-SV). Because most modern SV systems apply deep learning methods to boost performance, we focus on the studies that use deep speaker embedding, a technique representing a person's identity via a fixed-dimensional vector encoded from a variable-length utterance. Rather than detailing every existing SV system, we make an overview of the representative SV systems that have attracted wide attention. Furthermore, an increasing number of SV systems have been devoted to addressing real-world challenges such as reverberation and noise, and this has driven a large number of studies on practical SV. Therefore, the survey compares the existing SV systems in the Far-Field Speaker Verification Challenge 2020 (FFSVC 2020) to illustrate the most effective techniques for both TD-SV and TI-SV.

Original languageEnglish
Pages (from-to)99038-99049
Number of pages12
JournalIEEE Access
Volume10
DOIs
Publication statusPublished - Sept 2022

Keywords

  • deep speaker embedding
  • far-field speaker verification
  • Text-dependent speaker verification
  • text-independent speaker verification

ASJC Scopus subject areas

  • General Computer Science
  • General Materials Science
  • General Engineering
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'A Survey on Text-Dependent and Text-Independent Speaker Verification'. Together they form a unique fingerprint.

Cite this