A Fast Panoptic Segmentation Network for Self-Driving Scene Understanding

被引:2
作者
Majid, Abdul [1 ]
Kausar, Sumaira [1 ]
Tehsin, Samabia [1 ]
Jameel, Amina [2 ]
机构
[1] Bahiria Univ, Dept Comp Sci, Islamabad, Pakistan
[2] Bahiria Univ, Dept Software Engn, Karachi, Pakistan
来源
COMPUTER SYSTEMS SCIENCE AND ENGINEERING | 2022年 / 43卷 / 01期
关键词
Panoptic segmentation; instance segmentation; semantic segmentation; deep learning; computer vision; scene understanding; autonomous applications; atrous convolution;
D O I
10.32604/csse.2022.022590
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, a gain in popularity and significance of science understanding has been observed due to the high paced progress in computer vision techniques and technologies. The primary focus of computer vision based scene understanding is to label each and every pixel in an image as the category of the object it belongs to. So it is required to combine segmentation and detection in a single framework. Recently many successful computer vision methods has been developed to aid scene understanding for a variety of real world application. Scene understanding systems typically involves detection and segmentation of different natural and manmade things. A lot of research has been performed in recent years, mostly with a focus on things (a well-defined objects that has shape, orientations and size) with a less focus on stuff classes (amorphous regions that are unclear and lack a shape, size or other characteristics Stuff region describes many aspects of scene, like type, situation, environment of scene etc. and hence can be very helpful in scene understanding. Existing methods for scene understanding still have to cover a challenging path to cope up with the challenges of computational time, accuracy and robustness for varying level of scene complexity. A robust scene understanding method has to effectively deal with imbalanced distribution of classes, overlapping objects, fuzzy object boundaries and poorly localized objects. The proposed method presents Panoptic Segmentation on Cityscapes Dataset. Mobilenet-V2 is used as a backbone for feature extraction that is pre-trained on ImageNet. MobileNet-V2 with state-of-art encoder-decoder architecture of DeepLabV3+ with some customization and optimization is employed Atrous convolution along with Spatial Pyramid Pooling are also utilized in the proposed method to make it more accurate and robust. Very promising and encouraging results have been achieved that indicates the potential of the proposed method for robust scene understanding in a fast and reliable way.
引用
收藏
页码:27 / 43
页数:17
相关论文
共 23 条
[1]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[2]   Multilevel Model for Video Object Segmentation Based on Supervision Optimization [J].
Chen, Yadang ;
Hao, Chuanyan ;
Liu, Alex X. ;
Wu, Enhua .
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (08) :1934-1945
[3]   Panoptic-DeepLab: A Simple, Strong, and Fast Baseline for Bottom-Up Panoptic Segmentation [J].
Cheng, Bowen ;
Collins, Maxwell D. ;
Zhu, Yukun ;
Liu, Ting ;
Huang, Thomas S. ;
Adam, Hartwig ;
Chen, Liang-Chieh .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :12472-12482
[4]   Fast Panoptic Segmentation Network [J].
de Geus, Daan ;
Meletis, Panagiotis ;
Dubbelman, Gijs .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (02) :1742-1749
[5]  
de Geus D, 2019, IEEE INT VEH SYM, P709, DOI [10.1109/IVS.2019.8813788, 10.1109/ivs.2019.8813788]
[6]  
Dundar A., 2020, PROC IEEECVF C COMPU, P8070
[7]   Real-Time Panoptic Segmentation from Dense Detections [J].
Hou, Rui ;
Li, Jie ;
Bhargava, Arjun ;
Raventos, Allan ;
Guizilini, Vitor ;
Fang, Chao ;
Lynch, Jerome ;
Gaidon, Adrien .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :8520-8529
[8]   Accel: A Corrective Fusion Network for Efficient Semantic Segmentation on Video [J].
Jain, Samvit ;
Wang, Xin ;
Gonzalez, Joseph E. .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :8858-8867
[9]   Panoptic Feature Pyramid Networks [J].
Kirillov, Alexander ;
Girshick, Ross ;
He, Kaiming ;
Dollar, Piotr .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :6392-6401
[10]   Learning Instance Occlusion for Panoptic Segmentation [J].
Lazarow, Justin ;
Lee, Kwonjoon ;
Shi, Kunyu ;
Tu, Zhuowen .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :10717-10726