AuxNet: Auxiliary Tasks Enhanced Semantic Segmentation for Automated Driving

被引:22
作者
Chennupati, Sumanth [1 ,3 ]
Sistu, Ganesh [2 ]
Yogamani, Senthil [2 ]
Rawashdeh, Samir [3 ]
机构
[1] Valeo Troy, Troy, MI 48083 USA
[2] Valeo Vis Syst, Dublin, Ireland
[3] Univ Michigan, Dearborn, MI 48128 USA
来源
PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5 | 2019年
关键词
Semantic Segmentation; Multitask Learning; Auxiliary Tasks; Automated Driving;
D O I
10.5220/0007684106450652
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Decision making in automated driving is highly specific to the environment and thus semantic segmentation plays a key role in recognizing the objects in the environment around the car. Pixel level classification once considered a challenging task which is now becoming mature to be productized in a car. However, semantic annotation is time consuming and quite expensive. Synthetic datasets with domain adaptation techniques have been used to alleviate the lack of large annotated datasets. In this work, we explore an alternate approach of leveraging the annotations of other tasks to improve semantic segmentation. Recently, multi-task learning became a popular paradigm in automated driving which demonstrates joint learning of multiple tasks improves overall performance of each tasks. Motivated by this, we use auxiliary tasks like depth estimation to improve the performance of semantic segmentation task. We propose adaptive task loss weighting techniques to address scale issues in multi-task loss functions which become more crucial in auxiliary tasks. We experimented on automotive datasets including SYNTHIA and KITTI and obtained 3% and 5% improvement in accuracy respectively.
引用
收藏
页码:645 / 652
页数:8
相关论文
共 42 条
[11]  
Eigen D., 2015, 2015 IEEE INT C COMP
[12]  
Freeman I, 2018, IEEE IMAGE PROC, P6, DOI 10.1109/ICIP.2018.8451339
[13]   Virtual Worlds as Proxy for Multi-Object Tracking Analysis [J].
Gaidon, Adrien ;
Wang, Qiao ;
Cabon, Yohann ;
Vig, Eleonora .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :4340-4349
[14]   Vision meets robotics: The KITTI dataset [J].
Geiger, A. ;
Lenz, P. ;
Stiller, C. ;
Urtasun, R. .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2013, 32 (11) :1231-1237
[15]   Dynamic Task Prioritization for Multitask Learning [J].
Guo, Michelle ;
Haque, Albert ;
Huang, De-An ;
Yeung, Serena ;
Li Fei-Fei .
COMPUTER VISION - ECCV 2018, PT XVI, 2018, 11220 :282-299
[16]  
Gurram A, 2018, IEEE INT VEH SYM, P2176, DOI 10.1109/IVS.2018.8500683
[17]   FuseNet: Incorporating Depth into Semantic Segmentation via Fusion-Based CNN Architecture [J].
Hazirbas, Caner ;
Ma, Lingni ;
Domokos, Csaba ;
Cremers, Daniel .
COMPUTER VISION - ACCV 2016, PT I, 2017, 10111 :213-228
[18]  
He K., 2016, CVPR, DOI [10.1109/CVPR.2016.90, DOI 10.1109/CVPR.2016.90]
[19]  
Jafari Omid Hosseini, 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA), P4620, DOI 10.1109/ICRA.2017.7989537
[20]   Multi-Task Learning Using Uncertainty to Weigh Losses for Scene Geometry and Semantics [J].
Kendall, Alex ;
Gal, Yarin ;
Cipolla, Roberto .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7482-7491