TY - JOUR
T1 - DWANet: Focus on Foreground Features for More Accurate Location
AU - Hu, Jiwei
AU - Zheng, Yuxing
AU - Lam, Kin Man
AU - Lou, Ping
N1 - Funding Information:
This work was supported in part by the National Natural Science Foundation of China under Grant 52075404, and in part by the Application Foundation Frontier Special Project of Wuhan Science and Technology Bureau under Grant 2020010601012176.
Publisher Copyright:
© 2013 IEEE.
PY - 2022/3
Y1 - 2022/3
N2 - Object detection can locate objects in an image using bounding boxes, which can facilitate classification and image understanding, resulting in a wide range of applications. Knowing how to mine useful features from images and detect objects of different scales have become the focus for object-detection research. In this paper, considering the importance of foreground features in the process of object detection, a foreground feature extraction module, based on deformable convolution, is proposed, and the attention mechanism is integrated to suppress the interference from the background. To learn effective features, considering that different layers in a convolutional neural network have different contributions, we propose methods to learn the weights for feature fusion. Experiments on the VOC datasets and COCO datasets show that the proposed algorithm can effectively improve the object detection accuracy, which is 12.1% higher than Faster R-CNN, 1.5% higher than RefineDet, and 2.3% higher than the Hierarchical Shot Detector (HSD).
AB - Object detection can locate objects in an image using bounding boxes, which can facilitate classification and image understanding, resulting in a wide range of applications. Knowing how to mine useful features from images and detect objects of different scales have become the focus for object-detection research. In this paper, considering the importance of foreground features in the process of object detection, a foreground feature extraction module, based on deformable convolution, is proposed, and the attention mechanism is integrated to suppress the interference from the background. To learn effective features, considering that different layers in a convolutional neural network have different contributions, we propose methods to learn the weights for feature fusion. Experiments on the VOC datasets and COCO datasets show that the proposed algorithm can effectively improve the object detection accuracy, which is 12.1% higher than Faster R-CNN, 1.5% higher than RefineDet, and 2.3% higher than the Hierarchical Shot Detector (HSD).
KW - feature fusion
KW - foreground features
KW - multi-scale
KW - Object detection
UR - http://www.scopus.com/inward/record.url?scp=85126708658&partnerID=8YFLogxK
U2 - 10.1109/ACCESS.2022.3158681
DO - 10.1109/ACCESS.2022.3158681
M3 - Journal article
AN - SCOPUS:85126708658
SN - 2169-3536
VL - 10
SP - 30716
EP - 30729
JO - IEEE Access
JF - IEEE Access
ER -