Adversarial Training on Point Clouds for Sim-to-Real 3D Object Detection

被引:13
作者
DeBortoli, Robert [1 ]
Li Fuxin [1 ]
Kapoor, Ashish [2 ]
Hollinger, Geoffrey A. [1 ]
机构
[1] Oregon State Univ, Collaborat Robot & Intelligent Syst CoRIS Inst, Corvallis, OR 97331 USA
[2] Microsoft, One Microsoft Way, Redmond, WA 98052 USA
基金
美国食品与农业研究所; 美国国家科学基金会;
关键词
Deep learning methods; field robots; object detection; segmentation and categorization;
D O I
10.1109/LRA.2021.3093869
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In this work we address the problem of 3D object detection from point clouds in data-limited environments. Training with simulated data is a common approach in such scenarios; however a sim-to-real gap exists between clean and crisp simulated clouds and noisy real clouds. Previous sim-to-real approaches for processing point cloud scenes have compressed clouds into 2D and used 2D transfer techniques. However, this may compress useful 3D information and does not effectively reason about the unstructured nature of point cloud data. We thus propose a 3D adversarial training architecture that leverages an adaptive sampling module to reason about the unstructured nature of point cloud data. Our approach encourages the 3D feature encoder to extract features that are invariant across simulated and real scenes. We validate our approach in the context of the DARPA Subterranean Challenge and demonstrate that our 3D adversarial training architecture improves 3D object detection performance by up to 15% depending on the data representation.
引用
收藏
页码:6662 / 6669
页数:8
相关论文
共 28 条
[1]  
Achlioptas P., 2018, INT C MACHINE LEARNI, P40
[2]  
[Anonymous], 2018, DARPA SUBTERRANEAN C
[3]  
Arjovsky M, 2017, PR MACH LEARN RES, V70
[4]  
Barth Ruud., 2017, Proceedings of the IEEE IROS workshop on Agricultural Robotics, P18
[5]   Generating Highly Realistic Images of Skin Lesions with GANs [J].
Baur, Christoph ;
Albarqouni, Shadi ;
Navab, Nassir .
OR 2.0 CONTEXT-AWARE OPERATING THEATERS, COMPUTER ASSISTED ROBOTIC ENDOSCOPY, CLINICAL IMAGE-BASED PROCEDURES, AND SKIN IMAGE ANALYSIS, OR 2.0 2018, 2018, 11041 :260-267
[6]   ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes [J].
Dai, Angela ;
Chang, Angel X. ;
Savva, Manolis ;
Halber, Maciej ;
Funkhouser, Thomas ;
Niessner, Matthias .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2432-2443
[7]  
Ganin Y, 2017, ADV COMPUT VIS PATT, P189, DOI 10.1007/978-3-319-58347-1_10
[8]   Vision meets robotics: The KITTI dataset [J].
Geiger, A. ;
Lenz, P. ;
Stiller, C. ;
Urtasun, R. .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2013, 32 (11) :1231-1237
[9]  
Gulrajani I, 2017, ADV NEUR IN, V30
[10]  
Mo K., 2020, P EUR C COMP VIS, P683