Real-time semantic segmentation with dual interaction fusion network

被引：0

作者：

Qu, Shenming ^{[1
]}

Duan, Jiale ^{[1
]}

Lu, Yongyong ^{[1
]}

Cui, Can ^{[1
]}

Xie, Yuan ^{[1
]}

机构：

[1] Henan Univ, Software Coll, Kaifeng, Peoples R China

来源：

JOURNAL OF ELECTRONIC IMAGING | 2024年 / 33卷 / 02期

基金：

中国国家自然科学基金;

关键词：

real-time semantic segmentation; deep learning; feature fusion; dilated convolution;

D O I：

10.1117/1.JEI.33.2.023055

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Real-time semantic segmentation is critical in industries, such as autonomous driving and robotics, requiring both accuracy and speed. However, existing real-time segmentation algorithms often sacrifice low-level details to improve inference speed, leading to decreased segmentation accuracy. Therefore, we propose a new real-time semantic segmentation model dual interaction fusion network (DIFNet) to alleviate this problem. First, we propose a lightweight dual decoding fusion structure, which increases the focus on the low-level feature information and can extract richer edge details, while the structure reduces the computational overhead by decreasing the number of channels of the feature map during fusion. In addition, we construct a cross attention module to cross-weight fusion of high-level and low-level features through attention mechanism, which increases the interaction between features and effectively extracts features at different levels. Finally, we design a comprehensive perception module that introduces dilated convolution to expand the model's receptive field, enabling it to better capture global features. Our network was validated on the Cityscapes and CamVid datasets. Specifically, on a single Nvidia GTX 2080 Ti, DIFNet achieves 77.6% mIoU at 83.9 frames per second (FPS) for 1536x768 inputs on Cityscapes test set and 77.0% mIoU at 135.8 FPS for 960x720 inputs on CamVid. (c) 2024 SPIE and IS&T

引用

页数：14

共 50 条

[41] ResLMFFNet: a real-time semantic segmentation network for precision agriculture
Ulku, Irem
JOURNAL OF REAL-TIME IMAGE PROCESSING, 2024, 21 (04)
[42] Real-Time Semantic Segmentation Network Based on Octave Convolution
Wang Xin
Wu Kaijun
LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (08)
[43] Lightweight Asymmetric Dilation Network for Real-Time Semantic Segmentation
Hu, Xuegang
Gong, Yu
IEEE ACCESS, 2021, 9 : 55630 - 55643
[44] Contextual Attention Refinement Network for Real-Time Semantic Segmentation
Hao, Shijie
Zhou, Yuan
Zhang, Youming
Guo, Yanrong
IEEE ACCESS, 2020, 8 (08): : 55230 - 55240
[45] A lightweight network with attention decoder for real-time semantic segmentation
Wang, Kang
Yang, Jinfu
Yuan, Shuai
Li, Mingai
VISUAL COMPUTER, 2022, 38 (07) : 2329 - 2339
[46] A lightweight network with attention decoder for real-time semantic segmentation
Kang Wang
Jinfu Yang
Shuai Yuan
Mingai Li
The Visual Computer, 2022, 38 : 2329 - 2339
[47] Multiple Resolutions Detail Enhancement Network for Real-Time Image Semantic Segmentation
Gu J.
Sun X.
Feng J.
Yang S.
Liu F.
Jiao L.
IEEE Transactions on Artificial Intelligence, 2024, 5 (07): : 3393 - 3407
[48] Real-Time Semantic Segmentation Algorithm Based on Feature Fusion Technology
Cai Yu
Huang Xuegong
Zhian, Zhang
Zhu Xinnian
Ma Xiang
LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (02)
[49] A Lightweight Network with Multi-Scale Information Interaction Attention for Real-Time Semantic Segmentation
Hu, Xuegang
Xu, Shuhan
FIFTEENTH INTERNATIONAL CONFERENCE ON MACHINE VISION, ICMV 2022, 2023, 12701
[50] LBARNet: Lightweight bilateral asymmetric residual network for real-time semantic segmentation
Hu, Xuegang
Zhou, Baoman
COMPUTERS & GRAPHICS-UK, 2023, 116 : 1 - 12

← 1 2 3 4 5 →