Predicting Bird's-Eye-View Semantic Representations Using Correlated Context Learning

被引:0
|
作者
Chen, Yongquan [1 ]
Fan, Weiming [2 ]
Zheng, Wenli [3 ]
Huang, Rui [1 ]
Yu, Jiahui [1 ,4 ]
机构
[1] Chinese Univ Hong Kong, Shenzhen Inst Artificial Intelligence & Robot Soc, Shenzhen 518172, Peoples R China
[2] Shenyang Ligong Univ, Sch Automat & Elect Engn, Shenyang 110159, Peoples R China
[3] Dapeng Customs Peoples Republ China, Shenzhen 518083, Peoples R China
[4] Zhejiang Univ, Dept Biomed Engn, Hangzhou 310027, Peoples R China
来源
关键词
BEV; machine cognition; attention; transformers;
D O I
10.1109/LRA.2024.3384078
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
We redefine the concept of bird's-eye-view (BEV) imaging for machine cognition tasks, emphasizing its power as an image interpretation tool. Humans intuitively translate two-dimensional (2D) images into BEV representations by discerning and integrating spatial information, such as position and morphological aspects. Existing techniques focus primarily on improving accuracy in whole-to-whole mapping. However, this often results in a loss of global-local correlation, posing a significant challenge in predicting complex elements, such as multiscale dynamic objects and small-scale static objects in the distance. To address this issue, we propose correlated global-local spatial context learning (CGLSCL), one of the first attempts to amalgamate positional and morphological cues in translation for machine cognition tasks. Augmented by correlated learning, CGLSCL ensures more comprehensive BEV output, particularly for minor and fast-moving elements, which need to be captured more effectively than they are by existing methods. An evaluation of CGLSCL using the NuScenes and Argoverse 3D datasets demonstrated its superior performance compared to current state-of-the-art methods, particularly in predicting complex elements.
引用
收藏
页码:4718 / 4725
页数:8
相关论文
共 50 条
  • [31] X-Align: Cross-Modal Cross-View Alignment for Bird's-Eye-View Segmentation
    Borse, Shubhankar
    Klingner, Marvin
    Kumar, Varun Ravi
    Cai, Hong
    Almuzairee, Abdulaziz
    Yogamani, Senthil
    Porikli, Fatih
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 3286 - 3296
  • [32] Semantic Bird's-Eye View Road Line Mapping
    Bellusci, Matteo
    Cudrano, Paolo
    Mentasti, Simone
    Cortelazzo, Riccardo Erminio Filippo
    Matteucci, Matteo
    2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 4388 - 4395
  • [33] Improving Bird's Eye View Semantic Segmentation by Task Decomposition
    Zhao, Tianhao
    Chen, Yongcan
    Wu, Yu
    Liu, Tianyang
    Du, Bo
    Xiao, Peilun
    Qiu, Shi
    Yang, Hongda
    Li, Guozhen
    Yang, Yi
    Lin, Yutian
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 15512 - 15521
  • [34] ViPro-BEV: Few-Shot Visual Prompting for Bird's-Eye-View Perception
    Yuan, Guorong
    Huang, Huaibo
    Fan, Qihang
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT X, 2025, 15040 : 18 - 32
  • [35] BEVRefiner: Improving 3D Object Detection in Bird's-Eye-View via Dual Refinement
    Wang, Binglu
    Zheng, Haowen
    Zhang, Lei
    Liu, Nian
    Anwer, Rao Muhammad
    Cholakkal, Hisham
    Zhao, Yongqiang
    Li, Zhijun
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (10) : 15094 - 15105
  • [36] A Bird’s Eye View
    Elizabeth Adkins-Regan
    Archives of Sexual Behavior, 2017, 46 : 1593 - 1594
  • [37] A BIRD'S EYE VIEW
    Hanks, Robert
    SIGHT AND SOUND, 2018, 28 (01): : 102 - 102
  • [38] Bird's eye view
    不详
    PHYSICS WORLD, 2021, 34 (07) : 3 - 3
  • [39] Bird's eye view
    Andreas Trabesinger
    Nature Physics, 2011, 7 (8) : 595 - 595
  • [40] A Bird's Eye View
    Lowe, Nancy K.
    JOGNN-JOURNAL OF OBSTETRIC GYNECOLOGIC AND NEONATAL NURSING, 2008, 37 (06): : 617 - 618