Multiple Riemannian Kernel Hashing for Large-Scale Image Set Classification and Retrieval

被引:0
作者
Shen, Xiaobo [1 ]
Wu, Wei [1 ]
Wang, Xiaxin [1 ]
Zheng, Yuhui [2 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp & Engn, Nanjing 210094, Peoples R China
[2] Qinghai Normal Univ, Coll Comp, Xining 810016, Peoples R China
基金
中国国家自然科学基金;
关键词
Kernel; Manifolds; Codes; Measurement; Covariance matrices; Task analysis; Correlation; Hashing; image set classification; retrieval; Riemannian manifold; FACE RECOGNITION; DISTRIBUTIONS; APPEARANCE; MANIFOLD;
D O I
10.1109/TIP.2024.3419414
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Conventional image set methods typically learn from small to medium-sized image set datasets. However, when applied to large-scale image set applications such as classification and retrieval, they face two primary challenges: 1) effectively modeling complex image sets; and 2) efficiently performing tasks. To address the above issues, we propose a novel Multiple Riemannian Kernel Hashing (MRKH) method that leverages the powerful capabilities of Riemannian manifold and Hashing on effective and efficient image set representation. MRKH considers multiple heterogeneous Riemannian manifolds to represent each image set. It introduces a multiple kernel learning framework designed to effectively combine statistics from multiple manifolds, and constructs kernels by selecting a small set of anchor points, enabling efficient scalability for large-scale applications. In addition, MRKH further exploits inter- and intra-modal semantic structure to enhance discrimination. Instead of employing continuous feature to represent each image set, MRKH suggests learning hash code for each image set, thereby achieving efficient computation and storage. We present an iterative algorithm with theoretical convergence guarantee to optimize MRKH, and the computational complexity is linear with the size of dataset. Extensive experiments on five image set benchmark datasets including three large-scale ones demonstrate the proposed method outperforms state-of-the-arts in accuracy and efficiency particularly in large-scale image set classification and retrieval.
引用
收藏
页码:4261 / 4273
页数:13
相关论文
共 49 条
[31]   Supervised Adaptive Similarity Matrix Hashing [J].
Shi, Yang ;
Nie, Xiushan ;
Liu, Xingbo ;
Zou, Li ;
Yin, Yilong .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 :2755-2766
[32]   Hierarchical Hashing Learning for Image Set Classification [J].
Sun, Yuan ;
Wang, Xu ;
Peng, Dezhong ;
Ren, Zhenwen ;
Shen, Xiaobo .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 :1732-1744
[33]   Feature and Semantic Views Consensus Hashing for Image Set Classification [J].
Sun, Yuan ;
Peng, Dezhong ;
Huang, Haixiao ;
Ren, Zhenwen .
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, :2097-2105
[34]  
van der Maaten L, 2008, J MACH LEARN RES, V9, P2579
[35]   Rethinking Feature Distribution for Loss Functions in Image Classification [J].
Wan, Weitao ;
Zhong, Yuanyi ;
Li, Tianpeng ;
Chen, Jiansheng .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :9117-9126
[36]   A Survey on Learning to Hash [J].
Wang, Jingdong ;
Zhang, Ting ;
Song, Jingkuan ;
Sebe, Nicu ;
Shen, Heng Tao .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :769-790
[37]   Deep Metric Learning on the SPD Manifold for Image Set Classification [J].
Wang, Rui ;
Wu, Xiao-Jun ;
Xu, Tianyang ;
Hu, Cong ;
Kittler, Josef .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (02) :663-680
[38]   SymNet: A Simple Symmetric Positive Definite Manifold Deep Learning Method for Image Set Classification [J].
Wang, Rui ;
Wu, Xiao-Jun ;
Kittler, Josef .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (05) :2208-2222
[39]   Graph Embedding Multi-Kernel Metric Learning for Image Set Classification With Grassmannian Manifold-Valued Features [J].
Wang, Rui ;
Wu, Xiao-Jun ;
Kittler, Josef .
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 :228-242
[40]  
Wang R, 2018, INT C PATT RECOG, P627, DOI 10.1109/ICPR.2018.8546030