A deep architecture for log-Euclidean Fisher vector end-to-end learning with application to 3D point cloud classification

被引:1
作者
Chekir, Amira [1 ]
机构
[1] Univ Sci & Technol Houari Boumed USTHB, Fac Genie Elect FGE, Lab & Robot Parallelisme & Syst Embarques LRPE, Algiers, Algeria
关键词
3D point clouds; Log-Euclidean metric; Fisher vectors; Deep neural network; Covariance matrices; RGB-D data; RECOGNITION; HISTOGRAMS;
D O I
10.1016/j.gmod.2022.101164
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Point clouds are a widely used form of 3D data, which can be produced by depth sensors, such as RGB-D cameras. The classification of common elements of 3D point clouds remains an open research problem. We propose a new deep network approach for the end-to-end training of log-Euclidean Fisher vectors (LE-FVs), applied to the classification of 3D point clouds. Our method uses a log-Euclidean (LE) metric in order to extend the concept of Fisher vectors (FVs) to LE-FV encoding. The LE-FV was computed on covariance matrices of local 3D point cloud descriptors, representing multiple features. Our architecture is composed of two blocks. The first one aims to map the covariance matrices representing the 3D point cloud descriptors to the Euclidean space. The second block allows for joint and simultaneous learning of LE-FV Gaussian Mixture Model (GMM) parameters, LE-FV dimensionality reduction, and multi-label classification. Our LE-FV deep learning model is more accurate than the FV deep learning architecture. Additionally, the introduction of joint learning of 3D point cloud features in the log-Euclidean space, including LE-FV GMM parameters, LE-FV dimensionality reduction, and multi-label classification greatly improves the accuracy of classification. Our method has also been compared with the most popular methods in the literature for 3D point cloud classification, and it achieved good performance. The quantitative evidence will be shown through different experiments.
引用
收藏
页数:10
相关论文
共 63 条
[11]   White matter tractography for neurosurgical planning: A topography-based review of the current state of the art [J].
Essayed, Walid I. ;
Zhang, Fan ;
Unadkat, Prashin ;
Cosgrove, G. Rees ;
Golby, Alexandra J. ;
O'Donnell, Lauren J. .
NEUROIMAGE-CLINICAL, 2017, 15 :659-672
[12]  
Faraki M, 2015, PROC CVPR IEEE, P4951, DOI 10.1109/CVPR.2015.7299129
[13]   Log-Euclidean bag of words for human action recognition [J].
Faraki, Masoud ;
Palhang, Maziar ;
Sanderson, Conrad .
IET COMPUTER VISION, 2015, 9 (03) :331-339
[14]   Fisher tensors for classifying human epithelial cells [J].
Faraki, Masoud ;
Harandi, Mehrtash T. ;
Wiliem, Arnold ;
Lovell, Brian C. .
PATTERN RECOGNITION, 2014, 47 (07) :2348-2359
[15]  
Fehr D, 2014, IEEE INT CONF ROBOT, P5467, DOI 10.1109/ICRA.2014.6907663
[16]  
Fehr D, 2012, IEEE INT CONF ROBOT, P1793, DOI 10.1109/ICRA.2012.6224740
[17]   GVCNN: Group-View Convolutional Neural Networks for 3D Shape Recognition [J].
Feng, Yifan ;
Zhang, Zizhao ;
Zhao, Xibin ;
Ji, Rongrong ;
Gao, Yue .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :264-272
[18]  
Funkhouser A., 2015, FISHER SHAPENET INFO
[19]  
Huang ZW, 2017, AAAI CONF ARTIF INTE, P2036
[20]   Fisher Vector Coding for Covariance Matrix Descriptors Based on the Log-Euclidean and Affine Invariant Riemannian Metrics [J].
Ilea, Ioana ;
Bombrun, Lionel ;
Said, Salem ;
Berthoumieu, Yannick .
JOURNAL OF IMAGING, 2018, 4 (07)