OmniDet: Surround View Cameras Based Multi-Task Visual Perception Network for Autonomous Driving

被引:51
|
作者
Kumar, Varun Ravi [1 ,2 ]
Yogamani, Senthil [3 ]
Rashed, Hazem [4 ]
Sitsu, Ganesh [3 ]
Witt, Christian [1 ]
Leang, Isabelle [5 ]
Milz, Stefan [2 ]
Maeder, Patrick [2 ]
机构
[1] Valeo, D-96317 Kronach, Germany
[2] TU Ilmenau, D-98693 Ilmenau, Germany
[3] Valeo, Galway, Ireland
[4] Valeo, Giza, Egypt
[5] Valeo, Chatellerault, France
关键词
Autonomous systems; autonomous vehicles; computer vision; image reconstruction and distance learning;
D O I
10.1109/LRA.2021.3062324
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Surround View fisheye cameras are commonly deployed in automated driving for 360 degrees near-field sensing around the vehicle. This work presents a multi-task visual perception network on unrectified fisheye images to enable the vehicle to sense its surrounding environment. It consists of six primary tasks necessary for an autonomous driving system: depth estimation, visual odometry, semantic segmentation, motion segmentation, object detection, and lens soiling detection. We demonstrate that the jointly trained model performs better than the respective single task versions. Our multi-task model has a shared encoder providing a significant computational advantage and has synergized decoders where tasks support each other. We propose a novel camera geometry based adaptation mechanism to encode the fisheye distortion model both at training and inference. This was crucial to enable training on the WoodScape dataset, comprised of data from different parts of the world collected by 12 different cameras mounted on three different cars with different intrinsics and viewpoints. Given that bounding boxes is not a good representation for distorted fisheye images, we also extend object detection to use a polygon with non-uniformly sampled vertices. We additionally evaluate our model on standard automotive datasets, namely KITTI and Cityscapes. We obtain the state-of-the-art results on KITTI for depth estimation and pose estimation tasks and competitive performance on the other tasks. We perform extensive ablation studies on various architecture choices and task weighting methodologies. A short video at https://youtu.be/xbSjZ5OfPes provides qualitative results.
引用
收藏
页码:2830 / 2837
页数:8
相关论文
共 17 条
  • [1] LiDAR-Based Multi-Task Road Perception Network for Autonomous Vehicles
    Yan, Fuwu
    Wang, Kewei
    Zou, Bin
    Tang, Luqi
    Li, Wenbo
    Lv, Chen
    IEEE ACCESS, 2020, 8 : 86753 - 86764
  • [2] TriLiteNet: Lightweight Model for Multi-Task Visual Perception
    Che, Quang-Huy
    Lam, Duc-Khai
    IEEE ACCESS, 2025, 13 : 50152 - 50166
  • [3] Surround-View Fisheye BEV-Perception for Valet Parking: Dataset, Baseline and Distortion-Insensitive Multi-Task Framework
    Wu, Zizhang
    Gan, Yuanzhu
    Li, Xianzhi
    Wu, Yunzhe
    Wang, Xiaoquan
    Xu, Tianhao
    Wang, Fan
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (03): : 2037 - 2048
  • [4] Scalable Parallel Task Scheduling for Autonomous Driving Using Multi-Task Deep Reinforcement Learning
    Qi, Qi
    Zhang, Lingxin
    Wang, Jingyu
    Sun, Haifeng
    Zhuang, Zirui
    Liao, Jianxin
    Yu, F. Richard
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (11) : 13861 - 13874
  • [5] Multi-Task Deep Relative Attribute Learning for Visual Urban Perception
    Min, Weiqing
    Mei, Shuhuan
    Liu, Linhu
    Wang, Yi
    Jiang, Shuqiang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 657 - 669
  • [6] Unsupervised Reinforcement Learning for Multi-Task Autonomous Driving: Expanding Skills and Cultivating Curiosity
    Ma, Zhenyu
    Liu, Xinyi
    Huang, Yanjun
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (10) : 14209 - 14219
  • [7] A Camera-Based End-to-End Autonomous Driving Framework Combined With Meta-Based Multi-Task Optimization
    Rao, Zhongyu
    Cai, Yingfeng
    Wang, Hai
    Chen, Long
    Li, Yicheng
    Liu, Qingchao
    IEEE TRANSACTIONS ON TRANSPORTATION ELECTRIFICATION, 2025, 11 (01): : 4443 - 4455
  • [8] Research on Road Scene Understanding of Autonomous Vehicles Based on Multi-Task Learning
    Guo, Jinghua
    Wang, Jingyao
    Wang, Huinian
    Xiao, Baoping
    He, Zhifei
    Li, Lubin
    SENSORS, 2023, 23 (13)
  • [9] Strawberry Verticillium Wilt Detection Network Based on Multi-Task Learning and Attention
    Nie, Xuan
    Wang, Luyao
    Ding, Haoxuan
    Xu, Min
    IEEE ACCESS, 2019, 7 : 170003 - 170011
  • [10] WaterScenes: A Multi-Task 4D Radar-Camera Fusion Dataset and Benchmarks for Autonomous Driving on Water Surfaces
    Yao, Shanliang
    Guan, Runwei
    Wu, Zhaodong
    Ni, Yi
    Huang, Zile
    Liu, Ryan Wen
    Yue, Yong
    Ding, Weiping
    Lim, Eng Gee
    Seo, Hyungjoon
    Man, Ka Lok
    Ma, Jieming
    Zhu, Xiaohui
    Yue, Yutao
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 16584 - 16598