A deep architecture for log-Euclidean Fisher vector end-to-end learning with application to 3D point cloud classification

被引：1

作者：

Chekir, Amira ^{[1
]}

机构：

[1] Univ Sci & Technol Houari Boumed USTHB, Fac Genie Elect FGE, Lab & Robot Parallelisme & Syst Embarques LRPE, Algiers, Algeria

来源：

GRAPHICAL MODELS | 2022年 / 123卷

关键词：

3D point clouds; Log-Euclidean metric; Fisher vectors; Deep neural network; Covariance matrices; RGB-D data; RECOGNITION; HISTOGRAMS;

D O I：

10.1016/j.gmod.2022.101164

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Point clouds are a widely used form of 3D data, which can be produced by depth sensors, such as RGB-D cameras. The classification of common elements of 3D point clouds remains an open research problem. We propose a new deep network approach for the end-to-end training of log-Euclidean Fisher vectors (LE-FVs), applied to the classification of 3D point clouds. Our method uses a log-Euclidean (LE) metric in order to extend the concept of Fisher vectors (FVs) to LE-FV encoding. The LE-FV was computed on covariance matrices of local 3D point cloud descriptors, representing multiple features. Our architecture is composed of two blocks. The first one aims to map the covariance matrices representing the 3D point cloud descriptors to the Euclidean space. The second block allows for joint and simultaneous learning of LE-FV Gaussian Mixture Model (GMM) parameters, LE-FV dimensionality reduction, and multi-label classification. Our LE-FV deep learning model is more accurate than the FV deep learning architecture. Additionally, the introduction of joint learning of 3D point cloud features in the log-Euclidean space, including LE-FV GMM parameters, LE-FV dimensionality reduction, and multi-label classification greatly improves the accuracy of classification. Our method has also been compared with the most popular methods in the literature for 3D point cloud classification, and it achieved good performance. The quantitative evidence will be shown through different experiments.

引用

页数：10

共 63 条

[1]

[Anonymous], 2012, NIPS

[2]

[Anonymous], 2013, Advances in Neural Information Processing Systems

[3] Geometric means in a novel vector space structure on symmetric positive-definite matrices [J].

Arsigny, Vincent ;

Fillard, Pierre ;

Pennec, Xavier ;

Ayache, Nicholas .

SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 2007, 29 (01) :328-347

[4]

Beksi WJ, 2015, IEEE INT CONF ROBOT, P1880, DOI 10.1109/ICRA.2015.7139443

[5] Nesti-Net: Normal Estimation for Unstructured 3D Point Clouds using Convolutional Neural Networks [J].

Ben-Shabat, Yizhak ;

Lindenbaum, Michael ;

Fischer, Anath .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :10104-10112

[6] 3DmFV: Three-Dimensional Point Cloud Classification in Real-Time Using Convolutional Neural Networks [J].

Ben-Shabat, Yizhak ;

Lindenbaum, Michael ;

Fischer, Anath .

IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (04) :3145-3152

[7]

Blender Online Community, 2018, Blender-A 3D modelling and rendering package, P6

[8] A practical guide to CNNs and Fisher Vectors for image instance retrieval [J].

Chandrasekhar, Vijay ;

Lin, Jie ;

Morere, Olivier ;

Goh, Hanlin ;

Veillard, Antoine .

SIGNAL PROCESSING, 2016, 128 :426-439

[9] PPFNet: Global Context Aware Local Features for Robust 3D Point Matching [J].

Deng, Haowen ;

Birdal, Tolga ;

Ilie, Slobodan .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :195-205

[10]

Diba A, 2017, PROCEEDINGS OF THE FIFTEENTH IAPR INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS - MVA2017, P186, DOI 10.23919/MVA.2017.7986832

← 1 2 3 4 5 6 7 →