Toward Ultralow-Power Neuromorphic Speech Enhancement With Spiking-FullSubNet

Xiang Hao, Chenxiang Ma, Qu Yang, Jibin Wu (Corresponding Author), Kay Chen Tan

Research output: Journal article publicationJournal articleAcademic researchpeer-review

Abstract

Speech enhancement (SE) is critical for improving speech intelligibility and quality in various audio devices. In recent years, deep learning-based methods have significantly improved SE performance, but they often come with a high computational cost, which is prohibitive for a large number of edge devices, such as headsets and hearing aids. This work proposes an ultralow-power SE system based on the brain-inspired spiking neural network (SNN) called Spiking-FullSubNet. Spiking-FullSubNet follows a full-band and subband fusioned approach to effectively capture both global and local spectral information. To enhance the efficiency of computationally expensive subband modeling, we introduce a frequency partitioning method inspired by the sensitivity profile of the human peripheral auditory system. Furthermore, we introduce a novel spiking neuron model that can dynamically control the input information integration and forgetting, enhancing the multiscale temporal processing capability of SNN, which is critical for speech denoising. Experiments conducted on the recent Intel Neuromorphic Deep Noise Suppression (N-DNS) Challenge dataset show that the Spiking-FullSubNet surpasses state-of-the-art (SOTA) methods by large margins in terms of both speech quality and energy efficiency metrics. Notably, our system won the championship of the Intel N-DNS Challenge (algorithmic track), opening up a myriad of opportunities for ultralow-power SE at the edge.

Original languageEnglish
Pages (from-to)1-15
JournalIEEE Transactions on Neural Networks and Learning Systems
DOIs
Publication statusPublished - May 2025

Keywords

  • Neuromorphic computing
  • neuromorphic speech processing
  • speech enhancement (SE)
  • spiking neural network (SNN)

ASJC Scopus subject areas

  • Software
  • Computer Science Applications
  • Computer Networks and Communications
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Toward Ultralow-Power Neuromorphic Speech Enhancement With Spiking-FullSubNet'. Together they form a unique fingerprint.

Cite this