OmniDet: Surround View Cameras Based Multi-Task Visual Perception Network for Autonomous Driving

被引:51
|
作者
Kumar, Varun Ravi [1 ,2 ]
Yogamani, Senthil [3 ]
Rashed, Hazem [4 ]
Sitsu, Ganesh [3 ]
Witt, Christian [1 ]
Leang, Isabelle [5 ]
Milz, Stefan [2 ]
Maeder, Patrick [2 ]
机构
[1] Valeo, D-96317 Kronach, Germany
[2] TU Ilmenau, D-98693 Ilmenau, Germany
[3] Valeo, Galway, Ireland
[4] Valeo, Giza, Egypt
[5] Valeo, Chatellerault, France
关键词
Autonomous systems; autonomous vehicles; computer vision; image reconstruction and distance learning;
D O I
10.1109/LRA.2021.3062324
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Surround View fisheye cameras are commonly deployed in automated driving for 360 degrees near-field sensing around the vehicle. This work presents a multi-task visual perception network on unrectified fisheye images to enable the vehicle to sense its surrounding environment. It consists of six primary tasks necessary for an autonomous driving system: depth estimation, visual odometry, semantic segmentation, motion segmentation, object detection, and lens soiling detection. We demonstrate that the jointly trained model performs better than the respective single task versions. Our multi-task model has a shared encoder providing a significant computational advantage and has synergized decoders where tasks support each other. We propose a novel camera geometry based adaptation mechanism to encode the fisheye distortion model both at training and inference. This was crucial to enable training on the WoodScape dataset, comprised of data from different parts of the world collected by 12 different cameras mounted on three different cars with different intrinsics and viewpoints. Given that bounding boxes is not a good representation for distorted fisheye images, we also extend object detection to use a polygon with non-uniformly sampled vertices. We additionally evaluate our model on standard automotive datasets, namely KITTI and Cityscapes. We obtain the state-of-the-art results on KITTI for depth estimation and pose estimation tasks and competitive performance on the other tasks. We perform extensive ablation studies on various architecture choices and task weighting methodologies. A short video at https://youtu.be/xbSjZ5OfPes provides qualitative results.
引用
收藏
页码:2830 / 2837
页数:8
相关论文
共 50 条
  • [21] Multi-Task Assisted Driving Policy Learning Method for Autonomous Driving
    Luo, Yutao
    Xue, Zhicheng
    Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2024, 52 (10): : 31 - 40
  • [22] A Decision Control Method for Autonomous Driving Based on Multi-Task Reinforcement Learning
    Cai, Yingfeng
    Yang, Shaoqing
    Wang, Hai
    Teng, Chenglong
    Chen, Long
    IEEE ACCESS, 2021, 9 (09): : 154553 - 154562
  • [23] MultiNet: Multi-Modal Multi-Task Learning for Autonomous Driving
    Chowdhuri, Sauhaarda
    Pankaj, Tushar
    Zipser, Karl
    2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 1496 - 1504
  • [24] Multi-task Visual Perception Method in Dragon Orchards Based on OrchardYOLOP
    Zhao, Wenfeng
    Huang, Yuanjue
    Zhong, Minyue
    Li, Zhenyuan
    Luo, Zitao
    Huang, Jiajun
    Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2024, 55 (11): : 160 - 170
  • [25] YOLOPX: Anchor-free multi-task learning network for panoptic driving perception
    Zhan, Jiao
    Luo, Yarong
    Guo, Chi
    Wu, Yejun
    Meng, Jiawei
    Liu, Jingnan
    PATTERN RECOGNITION, 2024, 148
  • [26] Multi-task learning for dangerous object detection in autonomous driving
    Chen, Yaran
    Zhao, Dongbin
    Lv, Le
    Zhang, Qichao
    INFORMATION SCIENCES, 2018, 432 : 559 - 571
  • [27] TriLiteNet: Lightweight Model for Multi-Task Visual Perception
    Che, Quang-Huy
    Lam, Duc-Khai
    IEEE ACCESS, 2025, 13 : 50152 - 50166
  • [28] Dynamic Task Weighting Methods for Multi-task Networks in Autonomous Driving Systems
    Leang, Isabelle
    Sistu, Ganesh
    Buerger, Fabian
    Bursuc, Andrei
    Yogamani, Senthil
    2020 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2020,
  • [29] Multi-task Network for Panoptic Segmentation in Automated Driving
    Petrovai, Andra
    Nedevschi, Sergiu
    2019 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2019, : 2394 - 2401
  • [30] DRMNet: A Multi-Task Detection Model Based on Image Processing for Autonomous Driving Scenarios
    Zhao, Jiandong
    Wu, Di
    Yu, Zhixin
    Gao, Ziyou
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (12) : 15341 - 15355