Predicting Bird's-Eye-View Semantic Representations Using Correlated Context Learning

被引:0
|
作者
Chen, Yongquan [1 ]
Fan, Weiming [2 ]
Zheng, Wenli [3 ]
Huang, Rui [1 ]
Yu, Jiahui [1 ,4 ]
机构
[1] Chinese Univ Hong Kong, Shenzhen Inst Artificial Intelligence & Robot Soc, Shenzhen 518172, Peoples R China
[2] Shenyang Ligong Univ, Sch Automat & Elect Engn, Shenyang 110159, Peoples R China
[3] Dapeng Customs Peoples Republ China, Shenzhen 518083, Peoples R China
[4] Zhejiang Univ, Dept Biomed Engn, Hangzhou 310027, Peoples R China
来源
关键词
BEV; machine cognition; attention; transformers;
D O I
10.1109/LRA.2024.3384078
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
We redefine the concept of bird's-eye-view (BEV) imaging for machine cognition tasks, emphasizing its power as an image interpretation tool. Humans intuitively translate two-dimensional (2D) images into BEV representations by discerning and integrating spatial information, such as position and morphological aspects. Existing techniques focus primarily on improving accuracy in whole-to-whole mapping. However, this often results in a loss of global-local correlation, posing a significant challenge in predicting complex elements, such as multiscale dynamic objects and small-scale static objects in the distance. To address this issue, we propose correlated global-local spatial context learning (CGLSCL), one of the first attempts to amalgamate positional and morphological cues in translation for machine cognition tasks. Augmented by correlated learning, CGLSCL ensures more comprehensive BEV output, particularly for minor and fast-moving elements, which need to be captured more effectively than they are by existing methods. An evaluation of CGLSCL using the NuScenes and Argoverse 3D datasets demonstrated its superior performance compared to current state-of-the-art methods, particularly in predicting complex elements.
引用
收藏
页码:4718 / 4725
页数:8
相关论文
共 50 条
  • [42] A Bird's Eye View
    Adkins-Regan, Elizabeth
    ARCHIVES OF SEXUAL BEHAVIOR, 2017, 46 (06) : 1593 - 1594
  • [43] Bird's eye view
    Harrison, J
    NEW SCIENTIST, 1998, 157 (2122) : 52 - 52
  • [44] Holistic Autonomous Driving Understanding by Bird's-Eye-View Injected Multi-Modal Large Models
    Ding, Xinpeng
    Han, Jianhua
    Xu, Hang
    Liang, Xiaodan
    Zhang, Wei
    Li, Xiaomeng
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 13668 - 13677
  • [45] A Bird's Eye View on Requirements Engineering and Machine learning
    Iqbal, Tahira
    Elahidoost, Parisa
    Lucio, Levi
    2018 25TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE (APSEC 2018), 2018, : 11 - 20
  • [46] The Yale scientific expedition of 1871: A student's-eye-view
    Pankin, MF
    OREGON HISTORICAL QUARTERLY, 1998, 99 (04) : 374 - 435
  • [47] Front-to-Bird's-Eye-View Transformation for Autonomous Vehicles: A Class Imbalance-Based Approach
    Martinez-Barba, Daniel A.
    Valentin-Coronado, Luis M.
    Becerra, Israel
    Salzar-Colored, Sebastian
    Paredes-Orta, Carlos
    PATTERN RECOGNITION, MCPR 2024, 2024, 14755 : 166 - 176
  • [48] Efficient Semantic Segmentation for Visual Bird's-Eye View Interpretation
    Saemann, Timo
    Amende, Karl
    Milz, Stefan
    Witt, Christian
    Simon, Martin
    Petzold, Johannes
    INTELLIGENT AUTONOMOUS SYSTEMS 15, IAS-15, 2019, 867 : 679 - 688
  • [49] CoBEVT: Cooperative Bird's Eye View Semantic Segmentation with Sparse Transformers
    Xu, Runsheng
    Tu, Zhengzhong
    Xiang, Hao
    Shao, Wei
    Zhou, Bolei
    Ma, Jiaqi
    CONFERENCE ON ROBOT LEARNING, VOL 205, 2022, 205 : 989 - 1000
  • [50] BEVMOT: A multi-object detection and tracking method in Bird's-Eye-View via Spatiotemporal Transformers
    Cui, Chenglin
    Song, Ruiqi
    Li, XinQing
    Ai, Yunfeng
    2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 5446 - 5451