Real-time semantic segmentation with dual interaction fusion network

被引:0
|
作者
Qu, Shenming [1 ]
Duan, Jiale [1 ]
Lu, Yongyong [1 ]
Cui, Can [1 ]
Xie, Yuan [1 ]
机构
[1] Henan Univ, Software Coll, Kaifeng, Peoples R China
基金
中国国家自然科学基金;
关键词
real-time semantic segmentation; deep learning; feature fusion; dilated convolution;
D O I
10.1117/1.JEI.33.2.023055
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Real-time semantic segmentation is critical in industries, such as autonomous driving and robotics, requiring both accuracy and speed. However, existing real-time segmentation algorithms often sacrifice low-level details to improve inference speed, leading to decreased segmentation accuracy. Therefore, we propose a new real-time semantic segmentation model dual interaction fusion network (DIFNet) to alleviate this problem. First, we propose a lightweight dual decoding fusion structure, which increases the focus on the low-level feature information and can extract richer edge details, while the structure reduces the computational overhead by decreasing the number of channels of the feature map during fusion. In addition, we construct a cross attention module to cross-weight fusion of high-level and low-level features through attention mechanism, which increases the interaction between features and effectively extracts features at different levels. Finally, we design a comprehensive perception module that introduces dilated convolution to expand the model's receptive field, enabling it to better capture global features. Our network was validated on the Cityscapes and CamVid datasets. Specifically, on a single Nvidia GTX 2080 Ti, DIFNet achieves 77.6% mIoU at 83.9 frames per second (FPS) for 1536x768 inputs on Cityscapes test set and 77.0% mIoU at 135.8 FPS for 960x720 inputs on CamVid. (c) 2024 SPIE and IS&T
引用
收藏
页数:14
相关论文
共 50 条
  • [41] MFNet: Multi-Feature Fusion Network for Real-Time Semantic Segmentation in Road Scenes
    Lu, Mengxu
    Chen, Zhenxue
    Liu, Chengyun
    Ma, Sile
    Cai, Lei
    Qin, Hao
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (11) : 20991 - 21003
  • [42] RAFNet: Reparameterizable Across-Resolution Fusion Network for Real-Time Image Semantic Segmentation
    Chen, Lei
    Dai, Huhe
    Zheng, Yuan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (02) : 1212 - 1227
  • [43] RELAXNet: Residual efficient learning and attention expected fusion network for real-time semantic segmentation
    Liu, Jin
    Xu, Xiaoqing
    Shi, Yiqing
    Deng, Cheng
    Shi, Miaohua
    NEUROCOMPUTING, 2022, 474 : 115 - 127
  • [44] ULAF-Net: Ultra lightweight attention fusion network for real-time semantic segmentation
    Hu, Kaidi
    Xie, Zongxia
    Hu, Qinghua
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (07) : 2987 - 3003
  • [45] LCFNet: Loss Compensation Fusion Network for Real-time Semantic Segmentation of Urban Road Scenes
    Yang, Lu
    Bai, Yiwen
    Ren, Fenglei
    Zhang, Shiyu
    Bi, Chongke
    2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 347 - 354
  • [46] CPGNet: Cascade Point-Grid Fusion Network for Real-Time LiDAR Semantic Segmentation
    Li, Xiaoyan
    Zhang, Gang
    Pan, Hongyu
    Wang, Zhenhua
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 11117 - 11123
  • [47] MSCFNet: A Lightweight Network With Multi-Scale Context Fusion for Real-Time Semantic Segmentation
    Gao, Guangwei
    Xu, Guoan
    Yu, Yi
    Xie, Jin
    Yang, Jian
    Yue, Dong
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (12) : 25489 - 25499
  • [48] LRDNet: A lightweight and efficient network with refined dual attention decorder for real-time semantic segmentation
    Zhuang, Mingxi
    Zhong, Xunyu
    Gu, Dongbing
    Feng, Liying
    Zhong, Xungao
    Hu, Huosheng
    NEUROCOMPUTING, 2021, 459 : 349 - 360
  • [49] Bilateral network with dual-guided attention for real-time semantic segmentation of road scene
    Liao, Liang
    Wan, Liang
    Liu, Mingsheng
    Li, Shusheng
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (06)
  • [50] Real-Time Semantic Segmentation Network Based on Octave Convolution
    Wang Xin
    Wu Kaijun
    LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (08)