A fast instance segmentation with one-stage multi-task deep neural network for autonomous driving

被引:18
作者
Tseng, Kuo-Kun [1 ]
Lin, Jiangrui [1 ]
Chen, Chien-Ming [2 ]
Hassan, Mohammad Mehedi [3 ]
机构
[1] Harbin Inst Technol Shenzhen, Shenzhen, Peoples R China
[2] Shandong Univ Sci & Technol, Qingdao, Shandong, Peoples R China
[3] King Saud Univ, Coll Comp & Informat Sci, Informat Syst Dept, Res Chair Pervas & Mobile Comp, Riyadh 11543, Saudi Arabia
关键词
Instance segmentation; Autonomous driving; Multi-task neural network; Object detection; Image segmentation;
D O I
10.1016/j.compeleceng.2021.107194
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
An accurate real-time instance segmentation, which can perform both object detection and semantic segmentation at the same time with a multi-task neural network, is important for autonomous driving. This paper proposes a fast one-stage multi-task neural network for instance segmentation, which can meet the requirements of real-time processing with sufficient accuracy and that is more desirable for self-driving applications. With a one-stage strategy, it can perform object detection and segmentation concurrently. This paper conducts the related experiments with two public datasets. The cross-validation was carried out with set of variables to determine the optimal combination of each model and compared with the mainstream instance segmentation algorithms. According to our experiment, the proposed algorithm has five times the performance compared to the previous algorithms, which can meet the real-time requirement for autonomous driving applications.
引用
收藏
页数:10
相关论文
共 33 条
[1]  
[Anonymous], 2017, INT C COMP VIS
[2]  
[Anonymous], 2017, P IEEE C COMP VIS PA
[3]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[4]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[5]   The Cityscapes Dataset for Semantic Urban Scene Understanding [J].
Cordts, Marius ;
Omran, Mohamed ;
Ramos, Sebastian ;
Rehfeld, Timo ;
Enzweiler, Markus ;
Benenson, Rodrigo ;
Franke, Uwe ;
Roth, Stefan ;
Schiele, Bernt .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223
[6]  
Dai JF, 2016, ADV NEUR IN, V29
[7]  
Fu C-Y, 2019, Retinamask: Learning to predict masks improves state-of-the-art single-shot detection for free, P190103353
[8]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587
[9]  
He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
[10]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778