OmniDet: Surround View Cameras Based Multi-Task Visual Perception Network for Autonomous Driving

被引：51

作者：

Kumar, Varun Ravi ^{[1
,2
]}

Yogamani, Senthil ^{[3
]}

Rashed, Hazem ^{[4
]}

Sitsu, Ganesh ^{[3
]}

Witt, Christian ^{[1
]}

Leang, Isabelle ^{[5
]}

Milz, Stefan ^{[2
]}

Maeder, Patrick ^{[2
]}

机构：

[1] Valeo, D-96317 Kronach, Germany

[2] TU Ilmenau, D-98693 Ilmenau, Germany

[3] Valeo, Galway, Ireland

[4] Valeo, Giza, Egypt

[5] Valeo, Chatellerault, France

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2021年 / 6卷 / 02期

关键词：

Autonomous systems; autonomous vehicles; computer vision; image reconstruction and distance learning;

D O I：

10.1109/LRA.2021.3062324

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Surround View fisheye cameras are commonly deployed in automated driving for 360 degrees near-field sensing around the vehicle. This work presents a multi-task visual perception network on unrectified fisheye images to enable the vehicle to sense its surrounding environment. It consists of six primary tasks necessary for an autonomous driving system: depth estimation, visual odometry, semantic segmentation, motion segmentation, object detection, and lens soiling detection. We demonstrate that the jointly trained model performs better than the respective single task versions. Our multi-task model has a shared encoder providing a significant computational advantage and has synergized decoders where tasks support each other. We propose a novel camera geometry based adaptation mechanism to encode the fisheye distortion model both at training and inference. This was crucial to enable training on the WoodScape dataset, comprised of data from different parts of the world collected by 12 different cameras mounted on three different cars with different intrinsics and viewpoints. Given that bounding boxes is not a good representation for distorted fisheye images, we also extend object detection to use a polygon with non-uniformly sampled vertices. We additionally evaluate our model on standard automotive datasets, namely KITTI and Cityscapes. We obtain the state-of-the-art results on KITTI for depth estimation and pose estimation tasks and competitive performance on the other tasks. We perform extensive ablation studies on various architecture choices and task weighting methodologies. A short video at https://youtu.be/xbSjZ5OfPes provides qualitative results.

引用

页码：2830 / 2837

页数：8

共 17 条

[1] LiDAR-Based Multi-Task Road Perception Network for Autonomous Vehicles
Yan, Fuwu
Wang, Kewei
Zou, Bin
Tang, Luqi
Li, Wenbo
Lv, Chen
IEEE ACCESS, 2020, 8 : 86753 - 86764
[2] TriLiteNet: Lightweight Model for Multi-Task Visual Perception
Che, Quang-Huy
Lam, Duc-Khai
IEEE ACCESS, 2025, 13 : 50152 - 50166
[3] Surround-View Fisheye BEV-Perception for Valet Parking: Dataset, Baseline and Distortion-Insensitive Multi-Task Framework
Wu, Zizhang
Gan, Yuanzhu
Li, Xianzhi
Wu, Yunzhe
Wang, Xiaoquan
Xu, Tianhao
Wang, Fan
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (03): : 2037 - 2048
[4] Scalable Parallel Task Scheduling for Autonomous Driving Using Multi-Task Deep Reinforcement Learning
Qi, Qi
Zhang, Lingxin
Wang, Jingyu
Sun, Haifeng
Zhuang, Zirui
Liao, Jianxin
Yu, F. Richard
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (11) : 13861 - 13874
[5] Multi-Task Deep Relative Attribute Learning for Visual Urban Perception
Min, Weiqing
Mei, Shuhuan
Liu, Linhu
Wang, Yi
Jiang, Shuqiang
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 657 - 669
[6] Unsupervised Reinforcement Learning for Multi-Task Autonomous Driving: Expanding Skills and Cultivating Curiosity
Ma, Zhenyu
Liu, Xinyi
Huang, Yanjun
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (10) : 14209 - 14219
[7] A Camera-Based End-to-End Autonomous Driving Framework Combined With Meta-Based Multi-Task Optimization
Rao, Zhongyu
Cai, Yingfeng
Wang, Hai
Chen, Long
Li, Yicheng
Liu, Qingchao
IEEE TRANSACTIONS ON TRANSPORTATION ELECTRIFICATION, 2025, 11 (01): : 4443 - 4455
[8] Research on Road Scene Understanding of Autonomous Vehicles Based on Multi-Task Learning
Guo, Jinghua
Wang, Jingyao
Wang, Huinian
Xiao, Baoping
He, Zhifei
Li, Lubin
SENSORS, 2023, 23 (13)
[9] Strawberry Verticillium Wilt Detection Network Based on Multi-Task Learning and Attention
Nie, Xuan
Wang, Luyao
Ding, Haoxuan
Xu, Min
IEEE ACCESS, 2019, 7 : 170003 - 170011
[10] WaterScenes: A Multi-Task 4D Radar-Camera Fusion Dataset and Benchmarks for Autonomous Driving on Water Surfaces
Yao, Shanliang
Guan, Runwei
Wu, Zhaodong
Ni, Yi
Huang, Zile
Liu, Ryan Wen
Yue, Yong
Ding, Weiping
Lim, Eng Gee
Seo, Hyungjoon
Man, Ka Lok
Ma, Jieming
Zhu, Xiaohui
Yue, Yutao
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 16584 - 16598

← 1 2 →