Abstract
With the high processing power of today's smartphones, it becomes possible to turn a smartphone into a personal audio surveillance and monitoring system. Ideally, such a system should be able to detect and classify a variety of sound events 24 hours a day and trigger an emergence phone call or message once a specified sound event (e.g., screaming) occurs. To prolong battery life, it is important to trade off the detection accuracy against power consumption. This paper investigates the power consumption of different stages of a sound-event classification system, including segmentation, feature extraction, and SVM scoring. The performance and power consumption of various acoustic features and SVM kernels are compared. This paper advocates the notion of intrinsic complexity through which the scoring function of polynomial SVMs can be written in a matrix-vector-multiplication form so that the resulting complexity becomes independent of the number of support vectors. Results show that this intrinsic complexity can reduce the CPU utilization of polynomial SVMs by 28 times without reducing classification accuracy.
Original language | English |
---|---|
Title of host publication | 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2012 - Proceedings |
Pages | 1985-1988 |
Number of pages | 4 |
DOIs | |
Publication status | Published - 23 Oct 2012 |
Event | 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2012 - Kyoto, Japan Duration: 25 Mar 2012 → 30 Mar 2012 |
Conference
Conference | 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2012 |
---|---|
Country/Territory | Japan |
City | Kyoto |
Period | 25/03/12 → 30/03/12 |
Keywords
- audio surveillance
- kernel-energy tradeoff
- Low-power SVM
- smartphones
- sound event classification
ASJC Scopus subject areas
- Software
- Signal Processing
- Electrical and Electronic Engineering