Underwater Surface Normal Reconstruction via Cross-Grained Photometric Stereo Transformer

被引:7
作者
Ju, Yakun [1 ]
Li, Ling [1 ]
Zhong, Xian [2 ]
Rao, Yuan [3 ]
Liu, Yanru [3 ]
Dong, Junyu [3 ]
Kot, Alex C. [1 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
[2] Wuhan Univ Technol, Sch Comp Sci & Artificial Intelligence, Wuhan 430070, Peoples R China
[3] Ocean Univ China, Sch Comp Sci & Technol, Qingdao 266100, Peoples R China
基金
国家重点研发计划;
关键词
3-D reconstruction; cross-grained transformer (CGT); misalignment photometric images; underwater photometric stereo; SHAPE;
D O I
10.1109/JOE.2024.3458110
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Modern ocean research necessitates high-precision 3-D underwater data acquisition. Photometric stereo is a critical technique for recovering high-resolution, dense surface normals of textureless objects, such as the seabed and underwater pipelines. This technique is fundamental for underwater robots engaged in ocean exploration and operational tasks. Traditional underwater photometric stereo methods account for distributed underwater media, such as light scattering. However, the deployment of devices in complex underwater environments (e.g., ocean currents) often results in misalignment and jitter among photometric stereo images. These challenges lead to inaccuracies in matching-based methods, particularly due to the lack of texture and varying illumination conditions. To address these issues, we propose the Cross-Grained Transformer Photometric Stereo (CGT-PS) Network. CGT-PS is designed to directly manage misaligned pixels caused by underwater jitter in an end-to-end manner. The proposed method consists of two main components: the local-grained and global-grained modules. The local-grained module utilizes a Shift operation to adjust pixels within a single-pixel span, effectively mitigating misalignment caused by motion without increasing computational cost. In contrast, the global-grained module performs nonlocal fusion learning, leveraging distant features to enhance the extraction of intricate structural details, cast shadows, and interreflection regions. Ablation studies confirm the efficacy of the proposed modules. Extensive experiments on photometric stereo benchmark data sets and real underwater photometric stereo samples demonstrate that our method achieves superior performance.
引用
收藏
页码:192 / 203
页数:12
相关论文
共 53 条
[1]   Speeded-Up Robust Features (SURF) [J].
Bay, Herbert ;
Ess, Andreas ;
Tuytelaars, Tinne ;
Van Gool, Luc .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2008, 110 (03) :346-359
[2]   Learning inter- and intraframe representations for non-Lambertian photometric stereo [J].
Cao, Yanlong ;
Ding, Binjie ;
He, Zewei ;
Yang, Jiangxin ;
Chen, Jingxi ;
Cao, Yanpeng ;
Li, Xin .
OPTICS AND LASERS IN ENGINEERING, 2022, 150
[3]   Deep Photometric Stereo for Non-Lambertian Surfaces [J].
Chen, Guanying ;
Han, Kai ;
Shi, Boxin ;
Matsushita, Yasuyuki ;
Wong, Kwan-Yee K. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (01) :129-142
[4]   PS-FCN: A Flexible Learning Framework for Photometric Stereo [J].
Chen, Guanying ;
Han, Kai ;
Wong, Kwan-Yee K. .
COMPUTER VISION - ECCV 2018, PT IX, 2018, 11213 :3-19
[5]  
Dosovitskiy A., 2021, 9 INT C LEARN REPR I
[6]   Underwater Optical 3-D Reconstruction of Photometric Stereo Considering Light Refraction and Attenuation [J].
Fan, Hao ;
Qi, Lin ;
Chen, Changhao ;
Rao, Yuan ;
Kong, Linghui ;
Dong, Junyu ;
Yu, Hui .
IEEE JOURNAL OF OCEANIC ENGINEERING, 2022, 47 (01) :46-58
[7]   Refractive laser triangulation and photometric stereo in underwater environment [J].
Fan, Hao ;
Qi, Lin ;
Ju, Yakun ;
Dong, Junyu ;
Yu, Hui .
OPTICAL ENGINEERING, 2017, 56 (11)
[8]   Shape and Spatially-Varying BRDFs from Photometric Stereo [J].
Goldman, Dan B. ;
Curless, Brian ;
Hertzmann, Aaron ;
Seitz, Steven M. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (06) :1060-1071
[9]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[10]   Example-based photometric stereo: Shape reconstruction with general, varying BRDFs [J].
Hertzmann, A ;
Seitz, SM .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (08) :1254-1264