3DCapsule: Extending the Capsule Architecture to Classify 3D Point Clouds

被引:37
作者
Cheraghian, Ali [1 ]
Petersson, Lars [2 ]
机构
[1] Australian Natl Univ, CSIRO, Data61, Canberra, ACT, Australia
[2] CSIRO, Data61, Canberra, ACT, Australia
来源
2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV) | 2019年
关键词
D O I
10.1109/WACV.2019.00132
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper introduces the 3DCapsule, which is a 3D extension of the recently introduced Capsule concept that makes it applicable to unordered point sets. The original Capsule relies on the existence of a spatial relationship between the elements in the feature map it is presented with, whereas in point permutation invariant formulations of 3D point set classification methods, such relationships are typically lost. Here, a new layer called ComposeCaps is introduced that, in lieu of a spatially relevant feature mapping, learns a new mapping that can be exploited by the 3DCapsule. Previous works in the 3D point set classification domain have focused on other parts of the architecture, whereas instead, the 3DCapsule is a drop-in replacement of the commonly used fully connected classifier. It is demonstrated via an ablation study, that when the 3DCapsule is applied to recent 3D point set classification architectures, it consistently shows an improvement, in particular when subjected to noisy data. Similarly, the ComposeCaps layer is evaluated and demonstrates an improvement over the baseline. In an apples-to-apples comparison against state-of-the-art methods, again, better performance is demonstrated by the 3DCapsule.
引用
收藏
页码:1194 / 1202
页数:9
相关论文
共 25 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]  
[Anonymous], 1961, PRINCIPLES NEURODYNA, DOI DOI 10.21236/AD0256582
[3]  
[Anonymous], 2017, IEEE P COMPUT VIS PA, DOI DOI 10.1109/CVPR.2017.16
[4]  
[Anonymous], 2015, P IEEE C COMPUTER VI, DOI 10.1109/CVPR.2015.7298801
[5]  
Arandjelovic R, 2018, IEEE T PATTERN ANAL, V40, P1437, DOI [10.1109/CVPR.2016.572, 10.1109/TPAMI.2017.2711011]
[6]  
Hartigan J. A., 1979, Applied Statistics, V28, P100, DOI 10.2307/2346830
[7]  
Hinton GE, 2011, LECT NOTES COMPUT SC, V6791, P44, DOI 10.1007/978-3-642-21735-7_6
[8]  
Ioffe S, 2015, 32 INT C MACH LEARN
[9]  
Jaderberg M., 2015, Neural Inf. Process. Syst., V28, P2017, DOI DOI 10.48550/ARXIV.1506.02025
[10]  
Jaiswal A., 2018, P EUR C COMP VIS ECC