Why Discard if You can Recycle?: A Recycling Max Pooling Module for 3D Point Cloud Analysis

被引：11

作者：

Chen, Jiajing ^{[1
]}

Kakillioglu, Burak ^{[1
]}

Ren, Huantao ^{[1
]}

Velipasalar, Senem ^{[1
]}

机构：

[1] Syracuse Univ, Elect Engn & Comp Sci Dept, Syracuse, NY 13244 USA

来源：

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022) | 2022年

基金：

美国国家科学基金会;

关键词：

NORMALITY;

D O I：

10.1109/CVPR52688.2022.00064

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In recent years, most 3D point cloud analysis models have focused on developing either new network architectures or more efficient modules for aggregating point features from a local neighborhood. Regardless of the network architecture or the methodology used for improved feature learning, these models share one thing, which is the use of max-pooling in the end to obtain permutation invariant features. We first show that this traditional approach causes only a fraction of 3D points contribute to the permutation-invariant features, and discards the rest of the points. In order to address this issue and improve the performance of any baseline 3D point classification or segmentation model, we propose a new module, referred to as the Recycling Max-Pooling (RMP) module, to recycle and utilize the features of some of the discarded points. We incorporate a refinement loss that uses the recycled features to refine the prediction loss obtained from the features kept by traditional max-pooling. To the best of our knowledge, this is the first work that explores recycling of still useful points that are traditionally discarded by max-pooling. We demonstrate the effectiveness of the proposed RMP module by incorporating it into several milestone baselines and state-of-the-art networks for point cloud classification and indoor semantic segmentation tasks. We show that RPM, without any bells and whistles, consistently improves the performance of all the tested networks by using the same base network implementation and hyper-parameters. The code is provided in the supplementary material.

引用

页码：549 / 557

页数：9

共 28 条

[1] 3D Semantic Parsing of Large-Scale Indoor Spaces [J].

Armeni, Iro ;

Sener, Ozan ;

Zamir, Amir R. ;

Jiang, Helen ;

Brilakis, Ioannis ;

Fischer, Martin ;

Savarese, Silvio .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :1534-1543

[2]

Boulch A., 2017, 3DOR@ Eurographics, V2, P7

[3]

Chen JJ, 2021, Arxiv, DOI arXiv:2111.07248

[4] TESTS FOR DEPARTURE FROM NORMALITY - EMPIRICAL RESULTS FOR DISTRIBUTIONS OF B2 AND SQUARE ROOT B1 [J].

DAGOSTIN.R ;

PEARSON, ES .

BIOMETRIKA, 1973, 60 (03) :613-622

[5]

DAGOSTINO RB, 1971, BIOMETRIKA, V58, P341, DOI 10.1093/biomet/58.2.341

[6] Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis [J].

Dai, Angela ;

Qi, Charles Ruizhongtai ;

Niessner, Matthias .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6545-6554

[7]

Goyal Ankit, 2021, ICML

[8] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[9] PointGrid: A Deep Network for 3D Shape Understanding [J].

Le, Truc ;

Duan, Ye .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :9204-9214

[10] PointAugment: an Auto-Augmentation Framework for Point Cloud Classification [J].

Li, Ruihui ;

Li, Xianzhi ;

Heng, Pheng-Ann ;

Fu, Chi-Wing .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :6377-6386

← 1 2 3 →