Composite convolution: A flexible operator for deep learning on 3D point clouds

被引：0

作者：

Floris, Alberto ^{[1
]}

Frittoli, Luca ^{[1
]}

Carrera, Diego ^{[2
]}

Boracchi, Giacomo ^{[1
]}

机构：

[1] Politecn Milan, DEIB, Via Ponzio 34-5, Milan, Italy

[2] STMicroelectronics, Via Camillo Olivetti 2, Agrate Brianza, Italy

来源：

PATTERN RECOGNITION | 2024年 / 153卷

关键词：

3D point clouds; Deep learning; Convolution; Anomaly detection;

D O I：

10.1016/j.patcog.2024.110557

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep neural networks require specific layers to process point clouds, as the scattered and irregular location of 3D points prevents the use of conventional convolutional filters. We introduce the composite layer, a flexible and general alternative to the existing convolutional operators that process 3D point clouds. We design our composite layer to extract and compress the spatial information from the 3D coordinates of points and then combine this with the feature vectors. Compared to mainstream point-convolutional layers such as ConvPoint and KPConv, our composite layer guarantees greater flexibility in network design and provides an additional form of regularization. To demonstrate the generality of our composite layers, we define both a convolutional composite layer and an aggregate version that combines spatial information and features in a nonlinear manner, and we use these layers to implement CompositeNets. Our experiments on synthetic and real-world datasets show that, in both classification, segmentation, and anomaly detection, our CompositeNets outperform ConvPoint, which uses the same sequential architecture, and achieve similar results as KPConv, which has a deeper, residual architecture. Moreover, our CompositeNets achieve state-of-the-art performance in anomaly detection on point clouds. Our code is publicly available at https://github.com/sirolf-otrebla/CompositeNet.

引用

页数：11

共 48 条

[11] Deep open-set recognition for silicon wafer production monitoring [J].

Frittoli, Luca ;

Carrera, Diego ;

Rossi, Beatrice ;

Fragneto, Pasqualina ;

Boracchi, Giacomo .

PATTERN RECOGNITION, 2022, 124

[12]

Gencer Zeki, 2022, 2022 7th International Conference on Computer Science and Engineering (UBMK), P388, DOI 10.1109/UBMK55850.2022.9919505

[13]

Golan Izhak, 2018, Advances in Neural Information Processing Systems, V31

[14] 3D Semantic Segmentation with Submanifold Sparse Convolutional Networks [J].

Graham, Benjamin ;

Engelcke, Martin ;

van der Maaten, Laurens .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :9224-9232

[15] Deep Learning for 3D Point Clouds: A Survey [J].

Guo, Yulan ;

Wang, Hanyun ;

Hu, Qingyong ;

Liu, Hao ;

Liu, Li ;

Bennamoun, Mohammed .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (12) :4338-4364

[16]

Han XF, 2020, Arxiv, DOI arXiv:1802.02297

[17] GOOD: A global orthographic object descriptor for 3D object recognition and manipulation [J].

Kasaei, S. Hamidreza ;

Tome, Ana Maria ;

Lopes, Luis Seabra ;

Oliveira, Miguel .

PATTERN RECOGNITION LETTERS, 2016, 83 :312-320

[18]

Kingma Diederik P, 2015, PROC INT C LEARN REP

[19] DeepSIR: Deep semantic iterative registration for LiDAR point clouds [J].

Li, Qing ;

Wang, Cheng ;

Wen, Chenglu ;

Li, Xin .

PATTERN RECOGNITION, 2023, 137

[20] FPConv: Learning Local Flattening for Point Convolution [J].

Lin, Yiqun ;

Yan, Zizheng ;

Huang, Haibin ;

Du, Dong ;

Liu, Ligang ;

Cui, Shuguang ;

Han, Xiaoguang .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :4292-4301

← 1 2 3 4 5 →