Real-time semantic segmentation with dual interaction fusion network

被引:0
|
作者
Qu, Shenming [1 ]
Duan, Jiale [1 ]
Lu, Yongyong [1 ]
Cui, Can [1 ]
Xie, Yuan [1 ]
机构
[1] Henan Univ, Software Coll, Kaifeng, Peoples R China
基金
中国国家自然科学基金;
关键词
real-time semantic segmentation; deep learning; feature fusion; dilated convolution;
D O I
10.1117/1.JEI.33.2.023055
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Real-time semantic segmentation is critical in industries, such as autonomous driving and robotics, requiring both accuracy and speed. However, existing real-time segmentation algorithms often sacrifice low-level details to improve inference speed, leading to decreased segmentation accuracy. Therefore, we propose a new real-time semantic segmentation model dual interaction fusion network (DIFNet) to alleviate this problem. First, we propose a lightweight dual decoding fusion structure, which increases the focus on the low-level feature information and can extract richer edge details, while the structure reduces the computational overhead by decreasing the number of channels of the feature map during fusion. In addition, we construct a cross attention module to cross-weight fusion of high-level and low-level features through attention mechanism, which increases the interaction between features and effectively extracts features at different levels. Finally, we design a comprehensive perception module that introduces dilated convolution to expand the model's receptive field, enabling it to better capture global features. Our network was validated on the Cityscapes and CamVid datasets. Specifically, on a single Nvidia GTX 2080 Ti, DIFNet achieves 77.6% mIoU at 83.9 frames per second (FPS) for 1536x768 inputs on Cityscapes test set and 77.0% mIoU at 135.8 FPS for 960x720 inputs on CamVid. (c) 2024 SPIE and IS&T
引用
收藏
页数:14
相关论文
共 50 条
  • [31] STDBNet: Shared Trunk and Dual-Branch Network for Real-Time Semantic Segmentation
    Ren, Fenglei
    Zhou, Haibo
    Yang, Lu
    Bai, Yiwen
    Xu, Wenxue
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 770 - 774
  • [32] DMANet: Dual-branch multiscale attention network for real-time semantic segmentation
    Dong, Yongsheng
    Mao, Chongchong
    Zheng, Lintao
    Wu, Qingtao
    NEUROCOMPUTING, 2025, 617
  • [33] Bilateral network with rich semantic extractor for real-time semantic segmentation
    Shan Zhao
    Xuan Wu
    Kaiwen Tian
    Yang Yuan
    Complex & Intelligent Systems, 2024, 10 : 1899 - 1916
  • [34] Bilateral network with rich semantic extractor for real-time semantic segmentation
    Zhao, Shan
    Wu, Xuan
    Tian, Kaiwen
    Yuan, Yang
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (02) : 1899 - 1916
  • [35] PBSNet: pseudo bilateral segmentation network for real-time semantic segmentation
    Luo, Hui-Lan
    Liu, Chun-Yan
    Mahmoodi, Soroosh
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (04)
  • [36] Detail Guided Multilateral Segmentation Network for Real-Time Semantic Segmentation
    Jiang, Qunyan
    Dai, Juying
    Rui, Ting
    Shao, Faming
    Hu, Ruizhe
    Du, Yinan
    Zhang, Heng
    APPLIED SCIENCES-BASEL, 2022, 12 (21):
  • [37] ZMNet: feature fusion and semantic boundary supervision for real-time semantic segmentation
    Li, Ya
    Li, Ziming
    Liu, Huiwang
    Wang, Qing
    VISUAL COMPUTER, 2025, 41 (03): : 1543 - 1554
  • [38] Dual-inferences mechanism for real-time semantic segmentation
    Toan, Quyen Van
    Kim, Min Young
    2022 THIRTEENTH INTERNATIONAL CONFERENCE ON UBIQUITOUS AND FUTURE NETWORKS (ICUFN), 2022, : 12 - 17
  • [39] Dual-resolution transformer combined with multi-layer separable convolution fusion network for real-time semantic segmentation
    Hu, Kaidi
    Xie, Zongxia
    Hu, Qinghua
    COMPUTERS & GRAPHICS-UK, 2024, 118 : 220 - 232
  • [40] MLFNet: Multi-Level Fusion Network for Real-Time Semantic Segmentation of Autonomous Driving
    Fan, Jiaqi
    Wang, Fei
    Chu, Hongqing
    Hu, Xiao
    Cheng, Yifan
    Gao, Bingzhao
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (01): : 756 - 767