Lightweight multi-scale feature fusion with attention guidance for passive non-line-of-sight imaging

被引:0
作者
Chen, Pengyun [1 ]
Cui, Shuang [1 ]
Cao, Ning [1 ]
Zhang, Wenhao [1 ]
Wang, Pengfei [1 ]
Jin, Shaohui [1 ]
Xu, Mingliang [1 ,2 ]
机构
[1] Zhengzhou Univ, Sch Comp Sci & Artificial Intelligence, 100 Sci Ave, Zhengzhou 450001, Henan, Peoples R China
[2] Minist Educ, Engn Res Ctr Intelligent Swarm Syst, 100 Sci Ave, Zhengzhou 450001, Henan, Peoples R China
关键词
Non-line-of-sight imaging; Lightweighting; Multi-scale feature fusion; Attention-based network;
D O I
10.1007/s00371-025-03837-5
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Passive non-line-of-sight (NLOS) imaging is an emerging technology to enhance the perception of hidden objects. Existing methods often suffer from issues of imaging quality and model complexity. To address these challenges, we propose a lightweight multi-scale NLOS imaging network (LMS-NLOS). Our network combines a multi-scale encoder-decoder structure with a secondary detail-enhanced transformer module to capture fine-grained details. An asymmetric cross-fusion module is introduced to fuse shallow and deep features, reducing noise and redundancy. Additionally, a multi-scale loss function (MSLoss) is designed to enhance contour features and guide model training. To ensure model efficiency, the LMS-NLOS employs a perception-enhanced feed-forward network with a spatial shift operation (PEFNs). Experimental results on the VIS dataset collected by our laboratory and the public NLOS-Passive dataset demonstrate the effectiveness of our proposed methods. Compared with existing approaches, our multi-scale NLOS imaging network (MS-NLOS) achieves higher imaging quality, with a PSNR of 24.80 dB and an SSIM of 0.9234 on the VIS dataset. Meanwhile, LMS-NLOS offers a more compact model, reducing the model size by nearly half while maintaining good imaging performance. It achieves a PSNR of 24.26 dB and an SSIM of 0.8697 on the VIS dataset. Our work highlights the potential of attention-guided multi-scale feature fusion for lightweight passive NLOS imaging. The code is available at https://github.com/CS-wpf/LMS-NLOS.
引用
收藏
页数:14
相关论文
共 46 条
[1]   Hyper-NLOS: hyperspectral passive non-line-of-sight imaging [J].
Chen, Mingyang ;
Liu, Hao ;
Jin, Shaohui ;
Liu, Mengge ;
Xu, Ziqin ;
Jiang, Xiaoheng ;
Xu, Ming Liang .
OPTICS EXPRESS, 2024, 32 (20) :34807-34824
[2]   Steady-state Non-Line-of-Sight Imaging [J].
Chen, Wenzheng ;
Daneau, Simon ;
Mannan, Fahim ;
Heide, Felix .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :3783-6792
[3]   DEA-Net: Single Image Dehazing Based on Detail-Enhanced Convolution and Content-Guided Attention [J].
Chen, Zixuan ;
He, Zewei ;
Lu, Zhe-Ming .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 :1002-1015
[4]   Rethinking Coarse-to-Fine Approach in Single Image Deblurring [J].
Cho, Sung-Jin ;
Ji, Seo-Won ;
Hong, Jun-Pyo ;
Jung, Seung-Won ;
Ko, Sung-Jea .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :4621-4630
[5]  
Dey A., 2024, ECTI Transact Comput Inform Technol (ECTI-CIT), V18, P329
[6]   Passive Non-Line-of-Sight Imaging Using Optimal Transport [J].
Geng, Ruixu ;
Hu, Yang ;
Lu, Zhi ;
Yu, Cong ;
Li, Houqiang ;
Zhang, Hengyu ;
Chen, Yan .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 :110-124
[7]  
Hendrycks D, 2020, Arxiv, DOI arXiv:1606.08415
[8]   Non-Line-of-Sight Imaging and Vibrometry Using a Comb-Calibrated Coherent Sensor [J].
Huang, Xin ;
Ye, Ruilin ;
Li, Wenwen ;
Zeng, Jian-Wei ;
Lu, Yi-Chen ;
Hu, Huiqin ;
Zhou, Yijun ;
Hou, Lei ;
Li, Zheng-Ping ;
Jiang, Hai-Feng ;
Xue, Xianghui ;
Xu, Feihu ;
Dou, Xiankang ;
Pan, Jian-Wei .
PHYSICAL REVIEW LETTERS, 2024, 132 (23)
[9]   HiFuse: Hierarchical multi-scale feature fusion network for medical image classification [J].
Huo, Xiangzuo ;
Sun, Gang ;
Tian, Shengwei ;
Wang, Yan ;
Yu, Long ;
Long, Jun ;
Zhang, Wendong ;
Li, Aolun .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 87
[10]   Looking Around the Corner using Transient Imaging [J].
Kirmani, Ahmed ;
Hutchison, Tyler ;
Davis, James ;
Raskar, Ramesh .
2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, :159-166