PDBNet: Parallel Dual Branch Network for Real-time Semantic Segmentation

被引:12
作者
Dai, Yingpeng [1 ]
Wang, Junzheng [1 ]
Li, Jiehao [1 ]
Li, Jing [1 ]
机构
[1] Beijing Inst Technol, Sch Automat, State Key Lab Intelligent Control & Decis Complex, Beijing 100081, Peoples R China
基金
中国国家自然科学基金;
关键词
Lightweight network; neural network; real-time semantic segmentation; street scene; WHEEL-LEGGED ROBOT;
D O I
10.1007/s12555-021-0430-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To make a trade-off between accuracy and inference speed in real-time applications on the unmanned mobile platform, a novel neural network, named Parallel Dual Branch Network (PDBNet), is proposed. Firstly, a multi-scale module, namely Parallel Dual Branch (PDB), is designed to extract complete information. PDB module consists of two parallel branches to remove detailed low-level information and high-level semantic information while maintaining few parameters. Then, based on the PDB module, PDBNet, a small-scale and shallow structure, is designed for semantic segmentation. A multi-scale module tends to extract abundant information and segment the object out from the image well. The small-scale and shallow structure tends to accelerate the inference speed. So PDBNet architecture is designed to be effective both in terms of accuracy and inference speed. PDBNet adopts three downsamplings to obtain feature maps with high spatial resolution and uses PDB modules with different dilation rates to extract multi-scale features and enlarge the receptive field in the last several layers. Finally, experiments on Camvid dataset and Cityscapes dataset, we respectively get 67.7% and 69.5% Mean Intersection over Union (MIoU) with only 1.82 million parameters and quicker speed on a single GTX 1070Ti card.
引用
收藏
页码:2702 / 2711
页数:10
相关论文
共 45 条
[1]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[2]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[3]   Segmentation and Recognition Using Structure from Motion Point Clouds [J].
Brostow, Gabriel J. ;
Shotton, Jamie ;
Fauqueur, Julien ;
Cipolla, Roberto .
COMPUTER VISION - ECCV 2008, PT I, PROCEEDINGS, 2008, 5302 :44-+
[4]   Semantic object classes in video: A high-definition ground truth database [J].
Brostow, Gabriel J. ;
Fauqueur, Julien ;
Cipolla, Roberto .
PATTERN RECOGNITION LETTERS, 2009, 30 (02) :88-97
[5]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[6]   Socially Trusted Collaborative Edge Computing in Ultra Dense Networks [J].
Chen, Lixing ;
Xu, Jie .
SEC 2017: 2017 THE SECOND ACM/IEEE SYMPOSIUM ON EDGE COMPUTING (SEC'17), 2017,
[7]   Towards Hybrid Gait Obstacle Avoidance for a Six Wheel-Legged Robot with Payload Transportation [J].
Chen, Zhihua ;
Li, Jiehao ;
Wang, Junzheng ;
Wang, Shoukun ;
Zhao, Jiangbo ;
Li, Jing .
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2021, 102 (03)
[8]   The Cityscapes Dataset for Semantic Urban Scene Understanding [J].
Cordts, Marius ;
Omran, Mohamed ;
Ramos, Sebastian ;
Rehfeld, Timo ;
Enzweiler, Markus ;
Benenson, Rodrigo ;
Franke, Uwe ;
Roth, Stefan ;
Schiele, Bernt .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223
[9]   MDRNet: a lightweight network for real-time semantic segmentation in street scenes [J].
Dai, Yingpeng ;
Wang, Junzheng ;
Li, Jiehao ;
Li, Jing .
ASSEMBLY AUTOMATION, 2021, 41 (06) :725-733
[10]   Semantic Correlation Promoted Shape-Variant Context for Segmentation [J].
Ding, Henghui ;
Jiang, Xudong ;
Shuai, Bing ;
Liu, Ai Qun ;
Wang, Gang .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :8877-8886