Objectron: A Large Scale Dataset of Object-Centric Videos in the Wild with Pose Annotations

被引:77
作者
Ahmadyan, Adel [1 ]
Zhang, Liangkai [1 ]
Ablavatski, Artsiom [1 ]
Wei, Jianing [1 ]
Grundmann, Matthias [1 ]
机构
[1] Google Res, Mountain View, CA 94043 USA
来源
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年
关键词
D O I
10.1109/CVPR46437.2021.00773
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D object detection has recently become popular due to many applications in robotics, augmented reality, autonomy, and image retrieval. We introduce the Objectron dataset to advance the state of the art in 3D object detection and foster new research and applications, such as 3D object tracking, view synthesis, and improved 3D shape representation. The dataset contains object-centric short videos with pose annotations for nine categories and includes 4 million annotated images in 14, 819 annotated videos. We also propose a new evaluation metric, 3D Intersection over Union, for 3D object detection. We demonstrate the usefulness of our dataset in 3D object detection and novel view synthesis tasks by providing baseline models trained on this dataset. Our dataset and evaluation source code are available online at Github.com/google-research-datasets/Objectron.
引用
收藏
页码:7818 / 7827
页数:10
相关论文
共 38 条
[1]  
Avetisyan Armen, 2018, COMPUTER VISION PATT
[2]  
Blin F, 2016, LANG STUD SCI ENGINE, V2, P41, DOI 10.1075/lsse.2.03bli
[3]   Yale-CMU-Berkeley dataset for robotic manipulation research [J].
Calli, Berk ;
Singh, Arjun ;
Bruce, James ;
Walsman, Aaron ;
Konolige, Kurt ;
Srinivasa, Siddhartha ;
Abbeel, Pieter ;
Dollar, Aaron M. .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2017, 36 (03) :261-268
[4]   Toward making inroads in reducing the disparity of lung health in Australian Indigenous and New Zealand Maori children [J].
Chang, Anne B. ;
Marsh, Robyn L. ;
Upham, John W. ;
Hoffman, Lucas R. ;
Smith-Vaughan, Heidi ;
Holt, Deborah ;
Toombs, Maree ;
Byrnes, Catherine ;
Yerkovich, Stephanie T. ;
Torzillo, Paul J. ;
O'Grady, Kerry-Ann F. ;
Grimwood, Keith .
FRONTIERS IN PEDIATRICS, 2015, 3
[5]   Smoking and suicidal behaviours in a sample of US adults with low mood: a retrospective analysis of longitudinal data [J].
Covey, Lirio S. ;
Berlin, Ivan ;
Hu, Mei-Chen ;
Hakes, Jahn K. .
BMJ OPEN, 2012, 2 (03)
[6]   ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes [J].
Dai, Angela ;
Chang, Angel X. ;
Savva, Manolis ;
Halber, Maciej ;
Funkhouser, Thomas ;
Niessner, Matthias .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2432-2443
[7]  
Deng B., 2020, JaxNeRF: an efficient JAX implementation of NeRF
[8]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[9]   Recovering 6D Object Pose and Predicting Next-Best-View in the Crowd [J].
Doumanoglou, Andreas ;
Kouskouridas, Rigas ;
Malassiotis, Sotiris ;
Kim, Tae-Kyun .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3583-3592
[10]  
Ericson Christer., 2004, Real-time collision detection