Bird's-Eye-View Panoptic Segmentation Using Monocular Frontal View Images

被引:35
|
作者
Gosala, Nikhil [1 ]
Valada, Abhinav [1 ]
机构
[1] Univ Freiburg, Dept Comp Sci, Freiburg, Germany
关键词
Semantic scene understanding; object detection; segmentation and categorization; deep learning for visual perception;
D O I
10.1109/LRA.2022.3142418
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Bird's-Eye-View (BEV) maps have emerged as one of the most powerful representations for scene understanding due to their ability to provide rich spatial context while being easy to interpret and process. Such maps have found use in many real-world tasks that extensively rely on accurate scene segmentation as well as object instance identification in the BEV space for their operation. However, existing segmentation algorithms only predict the semantics in the BEV space, which limits their use in applications where the notion of object instances is also critical. In this work, we present the first BEV panoptic segmentation approach for directly predicting dense panoptic segmentation maps in the BEV, given a single monocular image in the frontal view (FV). Our architecture follows the top-down paradigm and incorporates a novel dense transformer module consisting of two distinct transformers that learn to independently map vertical and flat regions in the input image from the FVto the BEV. Additionally, we derive a mathematical formulation for the sensitivity of the FV-BEV transformation which allows us to intelligently weight pixels in the BEV space to account for the varying descriptiveness across the FV image. Extensive evaluations on the KITTI-360 and nuScenes datasets demonstrate that our approach exceeds the state-of-the-art in the PQ metric by 3.61 pp and 4.93 pp respectively.
引用
收藏
页码:1968 / 1975
页数:8
相关论文
共 50 条
  • [21] Delving Into the Devils of Bird's-Eye-View Perception: A Review, Evaluation and Recipe
    Li, Hongyang
    Sima, Chonghao
    Dai, Jifeng
    Wang, Wenhai
    Lu, Lewei
    Wang, Huijie
    Zeng, Jia
    Li, Zhiqi
    Yang, Jiazhi
    Deng, Hanming
    Tian, Hao
    Xie, Enze
    Xie, Jiangwei
    Chen, Li
    Li, Tianyu
    Li, Yang
    Gao, Yulu
    Jia, Xiaosong
    Liu, Si
    Shi, Jianping
    Lin, Dahua
    Qiao, Yu
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (04) : 2151 - 2170
  • [22] Progressive Temporal Transformer for Bird's-Eye-View Camera Pose Estimation
    Wu, Zhuoyuan
    Cai, Jiancheng
    Huang, Ranran
    Liu, Xinmin
    Chai, Zhenhua
    NEURAL INFORMATION PROCESSING, ICONIP 2023, PT VI, 2024, 14452 : 133 - 147
  • [23] BirdSLAM: Monocular Multibody SLAM in Bird's-eye View
    Daga, Swapnil
    Nair, Gokul B.
    Ramesh, Anirudha
    Sajnani, Rahul
    Ansari, Junaid Ahmed
    Krishna, K. Madhava
    VISAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 5: VISAPP, 2021, : 711 - 721
  • [24] RSBEV: Multiview Collaborative Segmentation of 3-D Remote Sensing Scenes With Bird's-Eye-View Representation
    Lin, Baihong
    Zou, Zhengxia
    Shi, Zhenwei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [25] Appetite control: worm's-eye-view
    You, Young-Jai
    Avery, Leon
    ANIMAL CELLS AND SYSTEMS, 2012, 16 (05) : 351 - 356
  • [26] Towards Viewpoint Robustness in Bird's Eye View Segmentation
    Klinghoffer, Tzofi
    Philion, Jonah
    Chen, Wenzheng
    Litany, Or
    Gojcic, Zan
    Joo, Jungseock
    Raskar, Ramesh
    Fidler, Sanja
    Alvarez, Jose M.
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 8481 - 8490
  • [27] BEVFormer: Learning Bird's-Eye-View Representation from Multi-camera Images via Spatiotemporal Transformers
    Li, Zhiqi
    Wang, Wenhai
    Li, Hongyang
    Xie, Enze
    Sima, Chonghao
    Lu, Tong
    Qiao, Yu
    Dai, Jifeng
    COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 : 1 - 18
  • [28] UniFusion: Unified Multi-view Fusion Transformer for Spatial-Temporal Representation in Bird's-Eye-View
    Qin, Zequn
    Chen, Jingyu
    Chen, Chao
    Chen, Xiaozhi
    Li, Xi
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 8656 - 8665
  • [29] A BIRD'S EYE VIEW
    Hanks, Robert
    SIGHT AND SOUND, 2018, 28 (01): : 102 - 102
  • [30] Bird's eye view
    Andreas Trabesinger
    Nature Physics, 2011, 7 (8) : 595 - 595