TY - GEN
T1 - Dense Point Prediction: A Simple Baseline for Crowd Counting and Localization
AU - Wang, Yi
AU - Hou, Xinyu
AU - Chau, Lap Pui
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021/7
Y1 - 2021/7
N2 - In this paper, we propose a simple yet effective crowd counting and localization network named SCALNet. Unlike most existing works that separate the counting and localization tasks, we consider those tasks as a pixel-wise dense prediction problem and integrate them into an end-To-end framework. Specifically, for crowd counting, we adopt a counting head supervised by the Mean Square Error (MSE) loss. For crowd localization, the key insight is to recognize the keypoint of people, i.e., the center point of heads. We propose a localization head to distinguish dense crowds trained by two loss functions, i.e., Negative-Suppressed Focal (NSF) loss and False-Positive (FP) loss, which balances the positive/negative examples and handles the false-positive predictions. Experiments on the recent and large-scale benchmark, NWPU-Crowd, show that our approach outperforms the state-of-The-Art methods by more than 5% and 10% improvement in crowd localization and counting tasks, respectively. The code is publicly available at https://github.com/WangyiNTU/SCALNet.
AB - In this paper, we propose a simple yet effective crowd counting and localization network named SCALNet. Unlike most existing works that separate the counting and localization tasks, we consider those tasks as a pixel-wise dense prediction problem and integrate them into an end-To-end framework. Specifically, for crowd counting, we adopt a counting head supervised by the Mean Square Error (MSE) loss. For crowd localization, the key insight is to recognize the keypoint of people, i.e., the center point of heads. We propose a localization head to distinguish dense crowds trained by two loss functions, i.e., Negative-Suppressed Focal (NSF) loss and False-Positive (FP) loss, which balances the positive/negative examples and handles the false-positive predictions. Experiments on the recent and large-scale benchmark, NWPU-Crowd, show that our approach outperforms the state-of-The-Art methods by more than 5% and 10% improvement in crowd localization and counting tasks, respectively. The code is publicly available at https://github.com/WangyiNTU/SCALNet.
KW - convolutional neural network (CNN)
KW - Crowd counting
KW - crowd localization
KW - dense prediction
KW - keypoint estimation
UR - http://www.scopus.com/inward/record.url?scp=85113665992&partnerID=8YFLogxK
U2 - 10.1109/ICMEW53276.2021.9455954
DO - 10.1109/ICMEW53276.2021.9455954
M3 - Conference article published in proceeding or book
AN - SCOPUS:85113665992
T3 - 2021 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2021
SP - 1
EP - 6
BT - 2021 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2021 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2021
Y2 - 5 July 2021 through 9 July 2021
ER -