Abstract
Binaural microphones, referring to two microphones with artificial human-shaped ears, are pervasively used in humanoid robots and hearing aids improving sound quality. In many applications, it is crucial for such robots to interact with humans by finding the voice direction. However, sound source localization with binaural microphones remains challenging, especially in multi-source scenarios. Prior works utilize microphone arrays to deal with the multi-source localization problem. Extra arrays yet incur higher deployment costs and take up more space. However, human brains have evolved to locate multiple sound sources with only two ears. Inspired by this fact, we propose DeepEar, a binaural microphone-based localization system that can locate multiple sounds. To this end, we develop a neural network to mimic the acoustic signal processing pipeline of the human auditory system. Different from hand-crafted features used in prior works, DeepEar can automatically extract useful features for localization. More importantly, the trained neural networks can be extended and adapted to new environments with a minimum amount of extra training data. Experiment results show that DeepEar can substantially outperform the state-of-the-art deep learning approach, with a sound detection accuracy of 93.3% and an azimuth estimation error of 7.4 degrees in multisource scenarios.
| Original language | English |
|---|---|
| Title of host publication | INFOCOM 2022 - IEEE Conference on Computer Communications |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 960-969 |
| Number of pages | 10 |
| ISBN (Electronic) | 9781665458221 |
| DOIs | |
| Publication status | Published - 2022 |
| Event | 41st IEEE Conference on Computer Communications, INFOCOM 2022 - Virtual, Online, United Kingdom Duration: 2 May 2022 → 5 May 2022 |
Publication series
| Name | Proceedings - IEEE INFOCOM |
|---|---|
| Volume | 2022-May |
| ISSN (Print) | 0743-166X |
Conference
| Conference | 41st IEEE Conference on Computer Communications, INFOCOM 2022 |
|---|---|
| Country/Territory | United Kingdom |
| City | Virtual, Online |
| Period | 2/05/22 → 5/05/22 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 3 Good Health and Well-being
Keywords
- Binaural localization
- Earable computing
- Multi-source localization
ASJC Scopus subject areas
- General Computer Science
- Electrical and Electronic Engineering
Fingerprint
Dive into the research topics of 'DeepEar: Sound Localization with Binaural Microphones'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver