DuFNet: Dual Flow Network of Real-Time Semantic Segmentation for Unmanned Driving Application of Internet of Things

被引：3

作者：

Duan, Tao ^{[1
]}

Liu, Yue ^{[1
]}

Li, Jingze ^{[1
]}

Lian, Zhichao ^{[2
]}

Li, Qianmu ^{[2
]}

机构：

[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China

[2] Nanjing Univ Sci & Technol, Sch Cyberspace Secur, Wuxi 320200, Peoples R China

来源：

CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES | 2023年 / 136卷 / 01期

关键词：

Real-time semantic segmentation; convolutional neural network; feature fusion; unmanned driving; fringe information flow; AGGREGATION;

D O I：

10.32604/cmes.2023.024742

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

The application of unmanned driving in the Internet of Things is one of the concrete manifestations of the application of artificial intelligence technology. Image semantic segmentation can help the unmanned driving system by achieving road accessibility analysis. Semantic segmentation is also a challenging technology for image understanding and scene parsing. We focused on the challenging task of real-time semantic segmentation in this paper. In this paper, we proposed a novel fast architecture for real-time semantic segmentation named DuFNet. Starting from the existing work of Bilateral Segmentation Network (BiSeNet), DuFNet proposes a novel Semantic Information Flow (SIF) structure for context information and a novel Fringe Information Flow (FIF) structure for spatial information. We also proposed two kinds of SIF with cascaded and paralleled structures, respectively. The SIF encodes the input stage by stage in the ResNet18 backbone and provides context information for the feature fusion module. Features from previous stages usually contain rich low-level details but high-level semantics for later stages. The multiple convolutions embed in Parallel SIF aggregate the corresponding features among different stages and generate a powerful global context representation with less computational cost. The FIF consists of a pooling layer and an upsampling operator followed by projection convolution layer. The concise component provides more spatial details for the network. Compared with BiSeNet, our work achieved faster speed and comparable performance with 72.34% mIoU accuracy and 78 FPS on Cityscapes Dataset based on the ResNet18 backbone.

引用

页码：223 / 239

页数：17

共 56 条

[1]

[Anonymous], 2016, NIPS WORKSH BARC SPA

[2]

Arani Elahe, 2021, P IEEE CVF WINT C AP, V3009

[3] New Enhanced Authentication Protocol for Internet of Things [J].

Azrour, Mourade ;

Mabrouki, Jamal ;

Guezzaz, Azedine ;

Farhaoui, Yousef .

BIG DATA MINING AND ANALYTICS, 2021, 4 (01) :1-9

[4] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].

Badrinarayanan, Vijay ;

Kendall, Alex ;

Cipolla, Roberto .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495

[5] Automatic scene parsing for generic object descriptions using shape primitives [J].

Buettner, Stefan ;

Marton, Zoltan-Csaba ;

Hertkorn, Katharina .

ROBOTICS AND AUTONOMOUS SYSTEMS, 2016, 76 :93-112

[6]

Catlett Charlie, 2020, Journal of Social Computing, V1, P14, DOI 10.23919/JSC.2020.0003

[7]

Chen LC, 2017, Arxiv, DOI arXiv:1706.05587

[8] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].

Chen, Liang-Chieh ;

Zhu, Yukun ;

Papandreou, George ;

Schroff, Florian ;

Adam, Hartwig .

COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851

[9] Attention to Scale: Scale-aware Semantic Image Segmentation [J].

Chen, Liang-Chieh ;

Yang, Yi ;

Wang, Jiang ;

Xu, Wei ;

Yuille, Alan L. .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3640-3649

[10]

Chen X, 2019, Arxiv, DOI [arXiv:1901.03814, 10.48550/arXiv.1901.03814]

← 1 2 3 4 5 6 →