Joint Transferable Dictionary Learning and View Adaptation for Multi-view Human Action Recognition

被引:8
作者
Sun, Bin [1 ]
Kong, Dehui [1 ]
Wang, Shaofan [1 ]
Wang, Lichun [1 ]
Yin, Baocai [1 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing 100124, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
Action recognition; multi-view; sparse representation; transfer learning; REPRESENTATION; SURVEILLANCE;
D O I
10.1145/3434746
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-view human action recognition remains a challenging problem due to large view changes. In this article, we propose a transfer learning-based framework called transferable dictionary learning and view adaptation (TDVA) model for multi-view human action recognition. In the transferable dictionary learning phase, TDVA learns a set of view-specific transferable dictionaries enabling the same actions from different views to share the same sparse representations, which can transfer features of actions from different views to an intermediate domain. In the view adaptation phase, TDVA comprehensively analyzes global, local, and individual characteristics of samples, and jointly learns balanced distribution adaptation, locality preservation, and discrimination preservation, aiming at transferring sparse features of actions of different views from the intermediate domain to a common domain. In other words, TDVA progressively bridges the distribution gap among actions from various views by these two phases. Experimental results on IXMAS, ACT4(2), and NUCLA action datasets demonstrate that TDVA outperforms state-of-the-art methods.
引用
收藏
页数:23
相关论文
共 72 条
  • [1] K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation
    Aharon, Michal
    Elad, Michael
    Bruckstein, Alfred
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (11) : 4311 - 4322
  • [2] [Anonymous], 2019, IEEE T PATTERN ANAL
  • [3] Ben-David S., 2007, NEURIPS, P137
  • [4] TRANSITIVE INFERENCES AND MEMORY IN YOUNG CHILDREN
    BRYANT, PE
    TRABASSO, T
    [J]. NATURE, 1971, 232 (5311) : 456 - &
  • [5] GameFlow: Narrative Visualization of NBA Basketball Games
    Chen, Wei
    Lao, Tianyi
    Xia, Jing
    Huang, Xinxin
    Zhu, Biao
    Hu, Wanqi
    Guan, Huihua
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2016, 18 (11) : 2247 - 2256
  • [6] Cheng ZW, 2012, LECT NOTES COMPUT SC, V7584, P52, DOI 10.1007/978-3-642-33868-7_6
  • [7] View-Invariant Deep Architecture for Human Action Recognition Using Two-Stream Motion and Shape Temporal Dynamics
    Dhiman, Chhavi
    Vishwakarma, Dinesh Kumar
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 3835 - 3844
  • [8] Donahue J, 2015, PROC CVPR IEEE, P2625, DOI 10.1109/CVPR.2015.7298878
  • [9] Scatter Component Analysis: A Unified Framework for Domain Adaptation and Domain Generalization
    Ghifary, Muhammad
    Balduzzi, David
    Kleijn, W. Bastiaan
    Zhang, Mengjie
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (07) : 1414 - 1430
  • [10] Gkioxari G, 2015, PROC CVPR IEEE, P759, DOI 10.1109/CVPR.2015.7298676