TY - GEN
T1 - Deep Discrete Wavelet Transform Network for Photometric Stereo
AU - Ju, Yakun
AU - Jian, Muwei
AU - Zhang, Cong
AU - Hu, Yeqi
AU - Lam, Kin Man
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023/7
Y1 - 2023/7
N2 - Photometric stereo aims to estimate the per-pixel surface normal map of 3D objects via changing the illuminated light directions. Prevalent methods adopt deep neural networks to extract the shading cue features and reconstruct the surface normals. However, previous methods do not consider the frequency of the surface structure, i.e., the complexity of the shape. Simply applying a trained network to all kinds of objects often leads to inter-frequency conflicts and blur in surface normal estimation. This paper presents a discrete wavelet transform-based photometric stereo network (DWTPS-Net) to handle the input photometric stereo images in both the spatial and frequency domains. In DWTPS-Net, we extract shading features from images and also decompose the images using discrete wavelet transform (DWT), which can preserve spatial information naturally, to better extract high-frequency information. We design separate CNN-based feature-extraction modules for the input images and for the different frequency information of the input images via DWT. Ablation studies and experiments on a widely used benchmark dataset show that DWTPS-Net achieves superior performance in surface normal estimation, in terms of mean angular error metric.
AB - Photometric stereo aims to estimate the per-pixel surface normal map of 3D objects via changing the illuminated light directions. Prevalent methods adopt deep neural networks to extract the shading cue features and reconstruct the surface normals. However, previous methods do not consider the frequency of the surface structure, i.e., the complexity of the shape. Simply applying a trained network to all kinds of objects often leads to inter-frequency conflicts and blur in surface normal estimation. This paper presents a discrete wavelet transform-based photometric stereo network (DWTPS-Net) to handle the input photometric stereo images in both the spatial and frequency domains. In DWTPS-Net, we extract shading features from images and also decompose the images using discrete wavelet transform (DWT), which can preserve spatial information naturally, to better extract high-frequency information. We design separate CNN-based feature-extraction modules for the input images and for the different frequency information of the input images via DWT. Ablation studies and experiments on a widely used benchmark dataset show that DWTPS-Net achieves superior performance in surface normal estimation, in terms of mean angular error metric.
UR - http://www.scopus.com/inward/record.url?scp=85165495489&partnerID=8YFLogxK
U2 - 10.1109/DSP58604.2023.10167967
DO - 10.1109/DSP58604.2023.10167967
M3 - Conference article published in proceeding or book
AN - SCOPUS:85165495489
T3 - International Conference on Digital Signal Processing, DSP
BT - 2023 24th International Conference on Digital Signal Processing, DSP 2023
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 24th International Conference on Digital Signal Processing, DSP 2023
Y2 - 11 June 2023 through 13 June 2023
ER -