CFEINet: Cross-fusion and feature enhancement interaction network for RGB-D semantic segmentation

被引:0
作者
Ge, Bin [1 ]
Lu, Yiming [1 ]
Xia, Chenxing [1 ,2 ]
Zhu, Xu [1 ]
Zhang, Mengge [1 ]
Gao, Mengya [1 ]
Chen, Ningjie [1 ]
机构
[1] Anhui Univ Sci & Technol, Coll Comp Sci & Engn, Huainan 232001, Peoples R China
[2] Anhui Univ Sci & Technol, Affiliated Hosp 1, Huainan Peoples Hosp 1, Huainan 232001, Peoples R China
关键词
Cross fusion; Cross-modal; Deep enhancement; Feature interaction; Semantic segmentation;
D O I
10.1016/j.dsp.2025.105043
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Currently, significant progress has been made in RGB-D semantic segmentation research. However, low- quality depth images and challenges in cross-modal feature interactions persist as two significant issues. Therefore, this paper proposes a CFEINet: cross-fusion and feature enhancement interaction network for RGBD semantic segmentation. CFEINet comprises three main components: the Two-Branch Asymmetric Feature Enhancement Module (TAEM), the Cross-Modal Feature Interaction Refinement Module (CFIM), and the Information Interaction Fusion Extraction Module (IIFM). TAEM employs a two-branch asymmetric enhancement technique to mitigate the impact of low-quality depth images by enhancing both depth and RGB features through boundary adaptation and new channel focus, respectively. CFIM emphasizes the consistency of features across different modalities, facilitating interaction between RGB and depth features to improve their quality. IIFM takes advantage of synchronising different modalities and using global-local information to compensate for inter- modal differences, thus enhancing target feature capture and improving segmentation performance. Extensive experiments conducted on the NYU Depth V2 and SUN-RGBD datasets demonstrate the superior performance of the proposed model compared to state-of-the-art methods.
引用
收藏
页数:11
相关论文
共 55 条
[1]  
Bai LZ, 2022, Arxiv, DOI arXiv:2210.06747
[2]   ShapeConv: Shape-aware Convolutional Layer for Indoor RGB-D Semantic Segmentation [J].
Cao, Jinming ;
Leng, Hanchao ;
Lischinski, Dani ;
Cohen-Or, Danny ;
Tu, Changhe ;
Li, Yangyan .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :7068-7077
[3]   RGBxD: Learning depth-weighted RGB patches for RGB-D indoor semantic segmentation [J].
Cao, Jinming ;
Leng, Hanchao ;
Cohen-Or, Daniel ;
Lischinski, Dani ;
Chen, Ying ;
Tu, Changhe ;
Li, Yangyan .
NEUROCOMPUTING, 2021, 462 :568-580
[4]   Depth-Quality-Aware Salient Object Detection [J].
Chen, Chenglizhao ;
Wei, Jipeng ;
Peng, Chong ;
Qin, Hong .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :2350-2363
[5]   EFDCNet: Encoding fusion and decoding correction network for RGB-D indoor semantic segmentation [J].
Chen, Jianlin ;
Li, Gongyang ;
Zhang, Zhijiang ;
Zeng, Dan .
IMAGE AND VISION COMPUTING, 2024, 142
[6]   FAFNet: Fully aligned fusion network for RGBD semantic segmentation based on hierarchical semantic flows [J].
Chen, Jiazhou ;
Zhan, Yangfan ;
Xu, Yanghui ;
Pan, Xiang .
IET IMAGE PROCESSING, 2023, 17 (01) :32-41
[7]   CaMap: Camera-based Map Manipulation on Mobile Devices [J].
Chen, Liang ;
Chen, Dongyi .
PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATION ENGINEERING (CSAE2018), 2018,
[8]   Spatial Information Guided Convolution for Real-Time RGBD Semantic Segmentation [J].
Chen, Lin-Zhuo ;
Lin, Zheng ;
Wang, Ziqin ;
Yang, Yong-Liang ;
Cheng, Ming-Ming .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :2313-2324
[9]   Bi-directional Cross-Modality Feature Propagation with Separation-and-Aggregation Gate for RGB-D Semantic Segmentation [J].
Chen, Xiaokang ;
Lin, Kwan-Yee ;
Wang, Jingbo ;
Wu, Wayne ;
Qian, Chen ;
Li, Hongsheng ;
Zeng, Gang .
COMPUTER VISION - ECCV 2020, PT XI, 2020, 12356 :561-577
[10]  
Doan G., 2024, A New cnn-Based Semantic Object Segmentation for Au- tonomous Vehicles in Urban Traffic Scenes