Dual Attention Dual-Resolution Networks for Real-Time Semantic Segmentation of Street Scenes

被引：0

作者：

Ye, Baofeng

Xue, Renzheng ^{[1
]}

机构：

[1] Qiqihar Univ, Sch Comp & Control Engn, Qiqihar 161006, Peoples R China

来源：

IEEE ACCESS | 2025年 / 13卷

关键词：

Semantics; Attention mechanisms; Feature extraction; Semantic segmentation; Real-time systems; Image edge detection; Convolution; Laplace equations; Computational efficiency; Complexity theory; attention; real-time; deep learning;

D O I：

10.1109/ACCESS.2024.3521958

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Semantic segmentation is a crucial technology for autonomous vehicles to acquire information about their surrounding environment. To ensure that semantic segmentation has practical application value in autonomous driving and robotics, it must achieve corresponding real-time inference speeds. However, existing models either improve accuracy at the cost of high computational expense and long inference times or enhance inference speed by sacrificing resolution and multi-level detailed information, resulting in a significant drop in accuracy. In this paper, we propose a new architecture based on a bilateral segmentation network, called DADNet. We have designed a new attention mechanism to optimize feature maps and a feature fusion module with an attention mechanism to efficiently merge different features, effectively expanding the receptive field. Our method demonstrates an excellent balance between segmentation accuracy and speed on the Cityscapes and CamVid datasets. Specifically, DADNet achieves a mIoU of 78.2% at 90.5 FPS on the Cityscapes validation set using a single 2080Ti GPU. On the CamVid test set, it achieves a mIoU of 75.8% at 136.7 FPS. Our approach outperforms most state-of-the-art models while requiring less computational power.

引用

页码：588 / 595

页数：8

共 41 条

[1] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
Badrinarayanan, Vijay
Kendall, Alex
Cipolla, Roberto
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
[2] Segmentation and Recognition Using Structure from Motion Point Clouds
Brostow, Gabriel J.
Shotton, Jamie
Fauqueur, Julien
Cipolla, Roberto
[J]. COMPUTER VISION - ECCV 2008, PT I, PROCEEDINGS, 2008, 5302 : 44 - +
[3] GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond
Cao, Yue
Xu, Jiarui
Lin, Stephen
Wei, Fangyun
Hu, Han
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 1971 - 1980
[4] Chen LC, 2017, Arxiv, DOI [arXiv:1706.05587, DOI 10.48550/ARXIV.1706.05587]
[5] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
Chen, Liang-Chieh
Papandreou, George
Kokkinos, Iasonas
Murphy, Kevin
Yuille, Alan L.
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848
[6] Cheng B, 2021, ADV NEUR IN, V34
[7] The Cityscapes Dataset for Semantic Urban Scene Understanding
Cordts, Marius
Omran, Mohamed
Ramos, Sebastian
Rehfeld, Timo
Enzweiler, Markus
Benenson, Rodrigo
Franke, Uwe
Roth, Stefan
Schiele, Bernt
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223
[8] Rethinking BiSeNet For Real-time Semantic Segmentation
Fan, Mingyuan
Lai, Shenqi
Huang, Junshi
Wei, Xiaoming
Chai, Zhenhua
Luo, Junfeng
Wei, Xiaolin
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 9711 - 9720
[9] Dual Attention Network for Scene Segmentation
Fu, Jun
Liu, Jing
Tian, Haijie
Li, Yong
Bao, Yongjun
Fang, Zhiwei
Lu, Hanqing
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3141 - 3149
[10] Deep Residual Learning for Image Recognition
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778

← 1 2 3 4 5 →