Three-dimensional displays;
History;
Task analysis;
Feature extraction;
Fuses;
Pipelines;
Detectors;
Multi-view 3D object detection;
recurrent network and long-term temporal fusion;
D O I:
10.1109/LRA.2024.3401172
中图分类号:
TP24 [机器人技术];
学科分类号:
080202 ;
1405 ;
摘要:
Long-term temporal fusion is a crucial but often overlooked technique in camera-based Bird's-Eye-View (BEV) 3D perception. Existing methods are mostly in a parallel manner. While parallel fusion can benefit from long-term information, it suffers from increasing computational and memory overheads as the fusion window size grows. Alternatively, BEVFormer adopts a recurrent fusion pipeline so that history information can be efficiently integrated, yet it fails to benefit from longer temporal frames. In this letter, we explore an embarrassingly simple long-term recurrent fusion strategy built upon the LSS-based methods and find it already able to enjoy the merits from both sides, i.e., rich long-term information and efficient fusion pipeline. A temporal embedding module is further proposed to improve the model's robustness against occasionally missed frames in practical scenarios. We name this simple but effective fusing pipeline VideoBEV. Experimental results on the nuScenes benchmark show that VideoBEV obtains strong performance on various camera-based 3D perception tasks, including object detection (<bold>55.4%</bold> mAP and <bold>62.9%</bold> NDS), segmentation (<bold>48.6%</bold> vehicle mIoU), tracking (<bold>54.8%</bold> AMOTA), and motion prediction (<bold>0.80 m</bold> minADE and <bold>0.463</bold> EPA).
机构:
Hefei Univ, Coll Artificial Intelligence & Big Data, Hefei 230027, Peoples R ChinaHefei Univ, Coll Artificial Intelligence & Big Data, Hefei 230027, Peoples R China
Xu, Lixiang
Cui, Qingzhe
论文数: 0引用数: 0
h-index: 0
机构:
Hefei Univ, Coll Artificial Intelligence & Big Data, Hefei 230027, Peoples R ChinaHefei Univ, Coll Artificial Intelligence & Big Data, Hefei 230027, Peoples R China
Cui, Qingzhe
Hong, Richang
论文数: 0引用数: 0
h-index: 0
机构:
Hefei Univ Technol, Sch Comp Sci & Informat Engn, Hefei 230009, Peoples R ChinaHefei Univ, Coll Artificial Intelligence & Big Data, Hefei 230027, Peoples R China
Hong, Richang
Xu, Wei
论文数: 0引用数: 0
h-index: 0
机构:
Hefei Univ, Coll Artificial Intelligence & Big Data, Hefei 230027, Peoples R ChinaHefei Univ, Coll Artificial Intelligence & Big Data, Hefei 230027, Peoples R China
Xu, Wei
Chen, Enhong
论文数: 0引用数: 0
h-index: 0
机构:
Univ Sci & Technol China, Sch Comp Sci & Technol, Anhui Prov Key Lab Big Data Anal & Applicat, Hefei 230000, Peoples R ChinaHefei Univ, Coll Artificial Intelligence & Big Data, Hefei 230027, Peoples R China
Chen, Enhong
Yuan, Xin
论文数: 0引用数: 0
h-index: 0
机构:
Univ Adelaide, Sch Elect & Mech Engn, Adelaide, SA 5005, AustraliaHefei Univ, Coll Artificial Intelligence & Big Data, Hefei 230027, Peoples R China
Yuan, Xin
Li, Chenglong
论文数: 0引用数: 0
h-index: 0
机构:
Anhui Univ, Sch Artificial Intelligence, Hefei 230601, Peoples R ChinaHefei Univ, Coll Artificial Intelligence & Big Data, Hefei 230027, Peoples R China
Li, Chenglong
Tang, Yuanyan
论文数: 0引用数: 0
h-index: 0
机构:
FST Univ Macau, Zhuhai UM Sci & Technol Res Inst, Macau 999078, Peoples R ChinaHefei Univ, Coll Artificial Intelligence & Big Data, Hefei 230027, Peoples R China
机构:
Shenyang Ligong Univ, Coll Equipment Engn, Shenyang 110159, Peoples R ChinaShenyang Ligong Univ, Coll Equipment Engn, Shenyang 110159, Peoples R China
Bai, Fan
Li, Lun
论文数: 0引用数: 0
h-index: 0
机构:
Weifang Univ, Inst Machinery & Automat, Weifang 261061, Peoples R ChinaShenyang Ligong Univ, Coll Equipment Engn, Shenyang 110159, Peoples R China
Li, Lun
Wang, Wencheng
论文数: 0引用数: 0
h-index: 0
机构:
Univ Engn Res Ctr Robot Vis Percept & Control, Weifang 261061, Peoples R ChinaShenyang Ligong Univ, Coll Equipment Engn, Shenyang 110159, Peoples R China
Wang, Wencheng
Wu, Xiaojin
论文数: 0引用数: 0
h-index: 0
机构:
Weifang Univ, Inst Machinery & Automat, Weifang 261061, Peoples R ChinaShenyang Ligong Univ, Coll Equipment Engn, Shenyang 110159, Peoples R China