Center-based 3D Object Detection and Tracking

被引：1106

作者：

Yin, Tianwei ^{[1
]}

Zhou, Xingyi ^{[1
]}

Krahenbuhl, Philipp ^{[1
]}

机构：

[1] UT Austin, Austin, TX 78712 USA

来源：

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年

基金：

美国国家科学基金会;

关键词：

D O I：

10.1109/CVPR46437.2021.01161

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Three-dimensional objects are commonly represented as 3D boxes in a point-cloud. This representation mimics the well-studied image-based 2D bounding-box detection but comes with additional challenges. Objects in a 3D world do not follow any particular orientation, and box-based detectors have difficulties enumerating all orientations or fitting an axis-aligned bounding box to rotated objects. In this paper, we instead propose to represent, detect, and track 3D objects as points. Our framework, CenterPoint, first detects centers of objects using a keypoint detector and regresses to other attributes, including 3D size, 3D orientation, and velocity. In a second stage, it refines these estimates using additional point features on the object. In CenterPoint, 3D object tracking simplifies to greedy closest-point matching. The resulting detection and tracking algorithm is simple, efficient, and effective. CenterPoint achieved state-of-the-art performance on the nuScenes benchmark for both 3D detection and tracking, with 65.5 NDS and 63.8 AMOTA for a single model. On the Waymo Open Dataset, Center-Point outperforms all previous single model methods by a large margin and ranks first among all Lidar-only submissions. The code and pretrained models are available at https://github. com/tianweiy/CenterPoint.

引用

页码：11779 / 11788

页数：10

共 66 条

[1]

[Anonymous], 2020, CVPR, DOI DOI 10.1109/CVPR42600.2020.00466

[2]

[Anonymous], 2020, CVPR, DOI DOI 10.1109/CVPR42600.2020.00190

[3]

[Anonymous], 2022, ternational

[4]

Bansal M, 2019, ROBOTICS: SCIENCE AND SYSTEMS XV

[5] Tracking without bells and whistles [J].

Bergmann, Philipp ;

Meinhardt, Tim ;

Leal-Taixe, Laura .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :941-951

[6]

BERNARDIN K., 2006, Multiple Object Tracking Performance Metrics and Evaluation in a Smart Room Environment

[7]

Bewley A, 2016, IEEE IMAGE PROC, P3464, DOI 10.1109/ICIP.2016.7533003

[8]

Bewley Alex, 2020, C ROB LEARN

[9] nuScenes: A multimodal dataset for autonomous driving [J].

Caesar, Holger ;

Bankiti, Varun ;

Lang, Alex H. ;

Vora, Sourabh ;

Liong, Venice Erin ;

Xu, Qiang ;

Krishnan, Anush ;

Pan, Yu ;

Baldan, Giancarlo ;

Beijbom, Oscar .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11618-11628

[10]

Chen Q., 2020, NeurIPS

← 1 2 3 4 5 6 7 →