ASVtorch toolkit: Speaker verification with deep neural networks

Kong Aik Lee, Ville Vestman, Tomi Kinnunen

Research output: Journal article publicationJournal articleAcademic researchpeer-review

8 Citations (Scopus)

Abstract

The human voice differs substantially between individuals. This facilitates automatic speaker verification (ASV) — recognizing a person from his/her voice. ASV accuracy has substantially increased throughout the past decade due to recent advances in machine learning, particularly deep learning methods. An unfortunate downside has been substantially increased complexity of ASV systems. To help non-experts to kick-start reproducible ASV development, a state-of-the-art toolkit implementing various ASV pipelines and functionalities is required. To this end, we introduce a new open-source toolkit, ASVtorch, implemented in Python using the widely used PyTorch machine learning framework.

Original languageEnglish
Article number100697
JournalSoftwareX
Volume14
DOIs
Publication statusPublished - Jun 2021
Externally publishedYes

Keywords

  • Deep learning
  • PyTorch
  • Speaker recognition

ASJC Scopus subject areas

  • Software
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'ASVtorch toolkit: Speaker verification with deep neural networks'. Together they form a unique fingerprint.

Cite this