TY - GEN
T1 - Deep Segment-Attentive Network for Altered-Engine Recognition
AU - Gan, Chong Xin
AU - Mak, Man Wai
AU - Ho, Ivan Wang Hei
AU - Lee, Steven Wing Chi
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2024/2
Y1 - 2024/2
N2 - Altered engine recognition is of significant research and application prospects since it is a sub-task of environmental sound classification and helps detect altered vehicles for law enforcement. In this work, we propose a simple but effective framework based on a sound embedding network with an attention mechanism to identify noise emitted from vehicles with altered engines. Real-world vehicle sound data were collected at a number of streets with different amounts of traffic in Hong Kong. In particular, we developed a proprietary dataset consisting of the environmental and engine sounds through manual segmentation and annotation, and we found that the attention mechanism can help emphasize the informative segments in the frame-level embeddings outputted by the embedding network. We also demonstrate the effectiveness of the proposed attention mechanism on the ESC-50 and UrbanSound8K datasets. The experimental results show that our method achieves remarkable performance on UrbanSound8K and outperforms the baseline on ESC-50 by more than 10%.
AB - Altered engine recognition is of significant research and application prospects since it is a sub-task of environmental sound classification and helps detect altered vehicles for law enforcement. In this work, we propose a simple but effective framework based on a sound embedding network with an attention mechanism to identify noise emitted from vehicles with altered engines. Real-world vehicle sound data were collected at a number of streets with different amounts of traffic in Hong Kong. In particular, we developed a proprietary dataset consisting of the environmental and engine sounds through manual segmentation and annotation, and we found that the attention mechanism can help emphasize the informative segments in the frame-level embeddings outputted by the embedding network. We also demonstrate the effectiveness of the proposed attention mechanism on the ESC-50 and UrbanSound8K datasets. The experimental results show that our method achieves remarkable performance on UrbanSound8K and outperforms the baseline on ESC-50 by more than 10%.
UR - http://www.scopus.com/inward/record.url?scp=85186534362&partnerID=8YFLogxK
U2 - 10.1109/ITSC57777.2023.10422665
DO - 10.1109/ITSC57777.2023.10422665
M3 - Conference article published in proceeding or book
AN - SCOPUS:85186534362
T3 - IEEE Conference on Intelligent Transportation Systems, Proceedings, ITSC
SP - 669
EP - 674
BT - 2023 IEEE 26th International Conference on Intelligent Transportation Systems, ITSC 2023
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 26th IEEE International Conference on Intelligent Transportation Systems, ITSC 2023
Y2 - 24 September 2023 through 28 September 2023
ER -