TY - JOUR
T1 - MHA-DGCLN: multi-head attention-driven dynamic graph convolutional lightweight network for multi-label image classification of kitchen waste
AU - Liang, Qiaokang
AU - Li, Jintao
AU - Qin, Hai
AU - Liu, Mingfeng
AU - Xiao, Xiao
AU - Zhang, Dongbo
AU - Wang, Yaonan
AU - Zhang, Dan
N1 - Publisher Copyright:
© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.
PY - 2024/12
Y1 - 2024/12
N2 - Kitchen waste images encompass a wide range of garbage categories, posing a typical multi-label classification challenge. However, due to the complex background and significant variations in garbage morphology, there is currently limited research on kitchen waste classification. In this paper, we propose a multi-head attention-driven dynamic graph convolution lightweight network for multi-label classification of kitchen waste images. Firstly, we address the issue of large model parameterization in traditional GCN methods by optimizing the backbone network for lightweight model design. Secondly, to overcome performance losses resulting from reduced model parameters, we introduce a multi-head attention mechanism to mitigate feature information loss, enhancing the feature extraction capability of the backbone network in complex scenarios and improving the correlation between graph nodes. Finally, the dynamic graph convolution module is employed to adaptively capture semantic-aware regions, further boosting recognition capabilities. Experiments conducted on our self-constructed multi-label kitchen waste classification dataset MLKW demonstrate that our proposed algorithm achieves a 8.6% and 4.8% improvement in mAP compared to the benchmark GCN-based methods ML-GCN and ADD-GCN, respectively, establishing state-of-the-art performance. Additionally, extensive experiments on two public datasets, MS-COCO and VOC2007, showcase excellent classification results, highlighting the strong generalization ability of our algorithm.
AB - Kitchen waste images encompass a wide range of garbage categories, posing a typical multi-label classification challenge. However, due to the complex background and significant variations in garbage morphology, there is currently limited research on kitchen waste classification. In this paper, we propose a multi-head attention-driven dynamic graph convolution lightweight network for multi-label classification of kitchen waste images. Firstly, we address the issue of large model parameterization in traditional GCN methods by optimizing the backbone network for lightweight model design. Secondly, to overcome performance losses resulting from reduced model parameters, we introduce a multi-head attention mechanism to mitigate feature information loss, enhancing the feature extraction capability of the backbone network in complex scenarios and improving the correlation between graph nodes. Finally, the dynamic graph convolution module is employed to adaptively capture semantic-aware regions, further boosting recognition capabilities. Experiments conducted on our self-constructed multi-label kitchen waste classification dataset MLKW demonstrate that our proposed algorithm achieves a 8.6% and 4.8% improvement in mAP compared to the benchmark GCN-based methods ML-GCN and ADD-GCN, respectively, establishing state-of-the-art performance. Additionally, extensive experiments on two public datasets, MS-COCO and VOC2007, showcase excellent classification results, highlighting the strong generalization ability of our algorithm.
KW - Attention mechanism
KW - Graph convolution network
KW - Kitchen waste
KW - Multi-label classification
UR - http://www.scopus.com/inward/record.url?scp=85207007566&partnerID=8YFLogxK
U2 - 10.1007/s10489-024-05819-x
DO - 10.1007/s10489-024-05819-x
M3 - Journal article
AN - SCOPUS:85207007566
SN - 0924-669X
VL - 54
SP - 13057
EP - 13074
JO - Applied Intelligence
JF - Applied Intelligence
IS - 24
ER -