Deep hybrid manifold for image set classification

被引:0
作者
Zeng, Xianhua [1 ,2 ]
Guo, Jueqiu [1 ]
Wei, Yifan [1 ]
Zhuo, Yang [1 ]
机构
[1] Chongqing Univ Posts & Telecommun, Sch Comp Sci & Technol, Sch Artificial Intelligence, Chongqing 400065, Peoples R China
[2] U Posts & Telecommun, Sch Comp Sci & Technol, Sch Artificial Intelligence, Chongqing 400065, Peoples R China
基金
中国国家自然科学基金;
关键词
SPD manifold; Grassmann manifold; Visual classification; Hybrid manifold; Neural network; GEOMETRY;
D O I
10.1016/j.imavis.2024.104935
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The exponential growth of the data volume of image sets, which contain more information than a single image, has attracted increasing attention from researchers. Image set data are often described as covariance matrices or linear subspaces, and the unique geometries they span are symmetric positive definite (SPD) manifolds and Grassmann manifolds, respectively. Image set data are often described as covariance matrices or linear subspaces, and the distinctive geometries they span are symmetric positive definite (SPD) manifold and Grassmann manifold, respectively. However, most studies focus on a single manifold and ignore the useful information of the another manifold. Based on this, we propose a new Deep Hybrid Manifold Network (DHMNet). The DHMNet consists of backbone network, stackable Hybrid Manifold AutoEncoder (HMAE) and,Maximum Fusion Module (MFM). The image set data is modeled through SPD manifold and Grassmann manifold. The modeled data is input into the backbone network composed of SPDNet and GrNet for initial feature extraction, and the output manifold data are input into HMAEs. The HMAE effectively extracts and hybridizes complementary information from different manifolds and has the ability to generate deep representations with rich structural semantic information. For the three image datasets used, DHMNet with two HMAEs improves the classification accuracy by 3.83-5.76% over the classical SPDNet, and even reaches the best when compared to other models, with the best performance on the First Person Hand Action (FPHA) dataset for skeleton -based hand action recognition.
引用
收藏
页数:15
相关论文
共 39 条
[1]  
Absil PA, 2008, OPTIMIZATION ALGORITHMS ON MATRIX MANIFOLDS, P1
[2]   Geometric means in a novel vector space structure on symmetric positive-definite matrices [J].
Arsigny, Vincent ;
Fillard, Pierre ;
Pennec, Xavier ;
Ayache, Nicholas .
SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 2007, 29 (01) :328-347
[3]   Generalized discriminant analysis using a kernel approach [J].
Baudat, G ;
Anouar, FE .
NEURAL COMPUTATION, 2000, 12 (10) :2385-2404
[4]   A Higher Order Manifold-Valued Convolutional Neural Network with Applications to Diffusion MRI Processing [J].
Bouza, Jose J. ;
Yang, Chun-Hao ;
Vaillancourt, David ;
Vemuri, Baba C. .
INFORMATION PROCESSING IN MEDICAL IMAGING, IPMI 2021, 2021, 12729 :304-317
[5]  
Brooks D, 2019, ADV NEUR IN, V32
[6]   ManifoldNet: A Deep Neural Network for Manifold-Valued Data With Applications [J].
Chakraborty, Rudrasis ;
Bouza, Jose ;
Manton, Jonathan ;
Vemuri, Baba C. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (02) :799-810
[7]   Hybrid Riemannian Graph-Embedding Metric Learning for Image Set Classification [J].
Chen, Ziheng ;
Xu, Tianyang ;
Wu, Xiao-Jun ;
Wang, Rui ;
Kittler, Josef .
IEEE TRANSACTIONS ON BIG DATA, 2023, 9 (01) :75-92
[8]  
Dhall A., 2014, P INT C MULT INT, P461
[9]   The geometry of algorithms with orthogonality constraints [J].
Edelman, A ;
Arias, TA ;
Smith, ST .
SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 1998, 20 (02) :303-353
[10]   A Robust Distance Measure for Similarity-Based Classification on the SPD Manifold [J].
Gao, Zhi ;
Wu, Yuwei ;
Harandi, Mehrtash ;
Jia, Yunde .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (09) :3230-3244