MultiNet plus plus : Multi-Stream Feature Aggregation and Geometric Loss Strategy for Multi-Task Learning

被引:39
作者
Chennupati, Sumanth [1 ,3 ]
Sistu, Ganesh [2 ]
Yogamani, Senthil [2 ]
Rawashdeh, Samir A. [3 ]
机构
[1] Valeo North Amer, Seymour, IN 47274 USA
[2] Valeo Vis Syst, Seymour, IN USA
[3] Univ Michigan, Dearborn, MI 48128 USA
来源
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019) | 2019年
关键词
NEURAL-NETWORKS;
D O I
10.1109/CVPRW.2019.00159
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-task learning is commonly used in autonomous driving for solving various visual perception tasks. It offers significant benefits in terms of both performance and computational complexity. Current work on multi-task learning networks focus on processing a single input image and there is no known implementation of multi-task learning handling a sequence of images. In this work, we propose a multi-stream multi-task network to take advantage of using feature representations from preceding frames in a video sequence for joint learning of segmentation, depth, and motion. The weights of the current and previous encoder are shared so that features computed in the previous frame can be leveraged without additional computation. In addition, we propose to use the geometric mean of task losses as a better alternative to the weighted average of task losses. The proposed loss function facilitates better handling of the difference in convergence rates of different tasks. Experimental results on KITTI, Cityscapes and SYNTHIA datasets demonstrate that the proposed strategies outperform various existing multi-task learning solutions.
引用
收藏
页码:1200 / 1210
页数:11
相关论文
共 60 条
[1]  
[Anonymous], 2019, ARXIV190203589
[2]  
[Anonymous], 2017, P IEEE 20 INT C INT
[3]  
Bilen H., 2017, ARXIV170107275
[4]   Multitask learning [J].
Caruana, R .
MACHINE LEARNING, 1997, 28 (01) :41-75
[5]   Numerical Simulation on Solidification Structure of 30Cr2Ni4MoV Steel Under Different Temperature Gradient Using Procast Software [J].
Chen, Zheng ;
Zhang, Jieyu .
CFD MODELING AND SIMULATION IN MATERIALS PROCESSING 2018, 2018, :3-8
[6]   AuxNet: Auxiliary Tasks Enhanced Semantic Segmentation for Automated Driving [J].
Chennupati, Sumanth ;
Sistu, Ganesh ;
Yogamani, Senthil ;
Rawashdeh, Samir .
PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2019, :645-652
[7]   The Cityscapes Dataset for Semantic Urban Scene Understanding [J].
Cordts, Marius ;
Omran, Mohamed ;
Ramos, Sebastian ;
Rehfeld, Timo ;
Enzweiler, Markus ;
Benenson, Rodrigo ;
Franke, Uwe ;
Roth, Stefan ;
Schiele, Bernt .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223
[8]   Multiple-gradient descent algorithm (MGDA) for multiobjective optimization [J].
Desideri, Jean-Antoine .
COMPTES RENDUS MATHEMATIQUE, 2012, 350 (5-6) :313-318
[9]  
Devin Coline, 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA), P2169, DOI 10.1109/ICRA.2017.7989250
[10]  
Dewangan P., 2018, CORR