Predicting Bird's-Eye-View Semantic Representations Using Correlated Context Learning

被引:0
|
作者
Chen, Yongquan [1 ]
Fan, Weiming [2 ]
Zheng, Wenli [3 ]
Huang, Rui [1 ]
Yu, Jiahui [1 ,4 ]
机构
[1] Chinese Univ Hong Kong, Shenzhen Inst Artificial Intelligence & Robot Soc, Shenzhen 518172, Peoples R China
[2] Shenyang Ligong Univ, Sch Automat & Elect Engn, Shenyang 110159, Peoples R China
[3] Dapeng Customs Peoples Republ China, Shenzhen 518083, Peoples R China
[4] Zhejiang Univ, Dept Biomed Engn, Hangzhou 310027, Peoples R China
来源
关键词
BEV; machine cognition; attention; transformers;
D O I
10.1109/LRA.2024.3384078
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
We redefine the concept of bird's-eye-view (BEV) imaging for machine cognition tasks, emphasizing its power as an image interpretation tool. Humans intuitively translate two-dimensional (2D) images into BEV representations by discerning and integrating spatial information, such as position and morphological aspects. Existing techniques focus primarily on improving accuracy in whole-to-whole mapping. However, this often results in a loss of global-local correlation, posing a significant challenge in predicting complex elements, such as multiscale dynamic objects and small-scale static objects in the distance. To address this issue, we propose correlated global-local spatial context learning (CGLSCL), one of the first attempts to amalgamate positional and morphological cues in translation for machine cognition tasks. Augmented by correlated learning, CGLSCL ensures more comprehensive BEV output, particularly for minor and fast-moving elements, which need to be captured more effectively than they are by existing methods. An evaluation of CGLSCL using the NuScenes and Argoverse 3D datasets demonstrated its superior performance compared to current state-of-the-art methods, particularly in predicting complex elements.
引用
收藏
页码:4718 / 4725
页数:8
相关论文
共 50 条
  • [1] Efficient Learning of Urban Driving Policies Using Bird's-Eye-View State Representations
    Trumpp, Raphael
    Buechner, Martin
    Valada, Abhinav
    Caccamo, Marco
    2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 4181 - 4186
  • [2] Camera-view supervision for bird's-eye-view semantic segmentation
    Yang, Bowen
    Yu, Linlin
    Chen, Feng
    FRONTIERS IN BIG DATA, 2024, 7
  • [3] SkyEye: Self-Supervised Bird's-Eye-View Semantic Mapping Using Monocular Frontal View Images
    Gosala, Nikhil
    Petek, Kuersat
    Drews-, Paulo L. J., Jr.
    Burgard, Wolfram
    Valada, Abhinav
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 14901 - 14910
  • [4] Bird's-Eye-View Panoptic Segmentation Using Monocular Frontal View Images
    Gosala, Nikhil
    Valada, Abhinav
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02) : 1968 - 1975
  • [5] LaRa: Latents and Rays for Multi-Camera Bird's-Eye-View Semantic Segmentation
    Bartoccioni, Florent
    Zablocki, Eloi
    Bursuc, Andrei
    Perez, Patrick
    Cord, Matthieu
    Alahari, Karteek
    CONFERENCE ON ROBOT LEARNING, VOL 205, 2022, 205 : 1663 - 1672
  • [6] Knowledge Distillation from 3D to Bird's-Eye-View for LiDAR Semantic Segmentation
    Jiang, Feng
    Gao, Heng
    Qiu, Shoumeng
    Zhang, Haiqiang
    Wan, Ru
    Pu, Jian
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 402 - 407
  • [7] A mammal and bird's-eye-view of the pupil during sleep and wakefulness
    Ungurean, Gianina
    Rattenborg, Niels C. C.
    EUROPEAN JOURNAL OF NEUROSCIENCE, 2024, 59 (04) : 584 - 594
  • [8] Application of Dynamic Deformable Attention in Bird's-Eye-View Detection
    Gu, Weihao
    Ai, Rui
    Liu, Jinlong
    Fan, Lili
    Cao, Dongpu
    Zhang, Kai
    IEEE JOURNAL OF RADIO FREQUENCY IDENTIFICATION, 2022, 6 : 886 - 890
  • [9] 3D Bird's-Eye-View Instance Segmentation
    Elich, Cathrin
    Engelmann, Francis
    Kontogianni, Theodora
    Leibe, Bastian
    PATTERN RECOGNITION, DAGM GCPR 2019, 2019, 11824 : 48 - 61
  • [10] Understanding the Robustness of 3D Object Detection with Bird's-Eye-View Representations in Autonomous Driving
    Zhu, Zijian
    Zhang, Yichi
    Chen, Hai
    Dong, Yinpeng
    Zhao, Shu
    Ding, Wenbo
    Zhong, Jiachen
    Zheng, Shibao
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 21600 - 21610