Composite convolution: A flexible operator for deep learning on 3D point clouds

被引：0

作者：

Floris, Alberto ^{[1
]}

Frittoli, Luca ^{[1
]}

Carrera, Diego ^{[2
]}

Boracchi, Giacomo ^{[1
]}

机构：

[1] Politecn Milan, DEIB, Via Ponzio 34-5, Milan, Italy

[2] STMicroelectronics, Via Camillo Olivetti 2, Agrate Brianza, Italy

来源：

PATTERN RECOGNITION | 2024年 / 153卷

关键词：

3D point clouds; Deep learning; Convolution; Anomaly detection;

D O I：

10.1016/j.patcog.2024.110557

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep neural networks require specific layers to process point clouds, as the scattered and irregular location of 3D points prevents the use of conventional convolutional filters. We introduce the composite layer, a flexible and general alternative to the existing convolutional operators that process 3D point clouds. We design our composite layer to extract and compress the spatial information from the 3D coordinates of points and then combine this with the feature vectors. Compared to mainstream point-convolutional layers such as ConvPoint and KPConv, our composite layer guarantees greater flexibility in network design and provides an additional form of regularization. To demonstrate the generality of our composite layers, we define both a convolutional composite layer and an aggregate version that combines spatial information and features in a nonlinear manner, and we use these layers to implement CompositeNets. Our experiments on synthetic and real-world datasets show that, in both classification, segmentation, and anomaly detection, our CompositeNets outperform ConvPoint, which uses the same sequential architecture, and achieve similar results as KPConv, which has a deeper, residual architecture. Moreover, our CompositeNets achieve state-of-the-art performance in anomaly detection on point clouds. Our code is publicly available at https://github.com/sirolf-otrebla/CompositeNet.

引用

页数：11

共 48 条

[1] One-Class Classification of Airborne LiDAR Data in Urban Areas Using a Presence and Background Learning Algorithm [J].

Ao, Zurui ;

Su, Yanjun ;

Li, Wenkai ;

Guo, Qinghua ;

Zhang, Jing .

REMOTE SENSING, 2017, 9 (10)

[2] 3D Semantic Parsing of Large-Scale Indoor Spaces [J].

Armeni, Iro ;

Sener, Ozan ;

Zamir, Amir R. ;

Jiang, Helen ;

Brilakis, Ioannis ;

Fischer, Martin ;

Savarese, Silvio .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :1534-1543

[3] Point Convolutional Neural Networks by Extension Operators [J].

Atzmon, Matan ;

Maron, Haggai ;

Lipman, Yaron .

ACM TRANSACTIONS ON GRAPHICS, 2018, 37 (04)

[4] The MVTec 3D-AD Dataset for Unsupervised 3D Anomaly Detection and Localization [J].

Bergmann, Paul ;

Jin, Xin ;

Sattlegger, David ;

Steger, Carsten .

PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2022, :202-213

[5] ConvPoint: Continuous convolutions for point cloud processing [J].

Boulch, Alexandre .

COMPUTERS & GRAPHICS-UK, 2020, 88 :24-34

[6]

Chang Angel X., 2015, arXiv

[7] 4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural Networks [J].

Choy, Christopher ;

Gwak, JunYoung ;

Savarese, Silvio .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :3070-3079

[8] ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes [J].

Dai, Angela ;

Chang, Angel X. ;

Savva, Manolis ;

Halber, Maciej ;

Funkhouser, Thomas ;

Niessner, Matthias .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2432-2443

[9]

Demsar J, 2006, J MACH LEARN RES, V7, P1

[10] DcTr: Noise-robust point cloud completion by dual-channel transformer with cross-attention [J].

Fei, Ben ;

Yang, Weidong ;

Ma, Lipeng ;

Chen, Wen-Ming .

PATTERN RECOGNITION, 2023, 133

← 1 2 3 4 5 →