Joint Transferable Dictionary Learning and View Adaptation for Multi-view Human Action Recognition

被引：10

作者：

Sun, Bin ^{[1
]}

Kong, Dehui ^{[1
]}

Wang, Shaofan ^{[1
]}

Wang, Lichun ^{[1
]}

Yin, Baocai ^{[1
]}

机构：

[1] Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing 100124, Peoples R China

来源：

ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA | 2021年 / 15卷 / 02期

基金：

北京市自然科学基金; 中国国家自然科学基金;

关键词：

Action recognition; multi-view; sparse representation; transfer learning; REPRESENTATION; SURVEILLANCE;

D O I：

10.1145/3434746

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Multi-view human action recognition remains a challenging problem due to large view changes. In this article, we propose a transfer learning-based framework called transferable dictionary learning and view adaptation (TDVA) model for multi-view human action recognition. In the transferable dictionary learning phase, TDVA learns a set of view-specific transferable dictionaries enabling the same actions from different views to share the same sparse representations, which can transfer features of actions from different views to an intermediate domain. In the view adaptation phase, TDVA comprehensively analyzes global, local, and individual characteristics of samples, and jointly learns balanced distribution adaptation, locality preservation, and discrimination preservation, aiming at transferring sparse features of actions of different views from the intermediate domain to a common domain. In other words, TDVA progressively bridges the distribution gap among actions from various views by these two phases. Experimental results on IXMAS, ACT4(2), and NUCLA action datasets demonstrate that TDVA outperforms state-of-the-art methods.

引用

页数：23

共 72 条

[1] K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation [J].

Aharon, Michal ;

Elad, Michael ;

Bruckstein, Alfred .

IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (11) :4311-4322

[2]

[Anonymous], 2019, IEEE T PATTERN ANAL

[3]

[Anonymous], 2016, LECT NOTES COMPUT SC

[4]

Ben-David Shai, 2007, ADV NEURAL INF PROCE

[5] TRANSITIVE INFERENCES AND MEMORY IN YOUNG CHILDREN [J].

BRYANT, PE ;

TRABASSO, T .

NATURE, 1971, 232 (5311) :456-&

[6] GameFlow: Narrative Visualization of NBA Basketball Games [J].

Chen, Wei ;

Lao, Tianyi ;

Xia, Jing ;

Huang, Xinxin ;

Zhu, Biao ;

Hu, Wanqi ;

Guan, Huihua .

IEEE TRANSACTIONS ON MULTIMEDIA, 2016, 18 (11) :2247-2256

[7]

Cheng ZW, 2012, LECT NOTES COMPUT SC, V7584, P52, DOI 10.1007/978-3-642-33868-7_6

[8] View-Invariant Deep Architecture for Human Action Recognition Using Two-Stream Motion and Shape Temporal Dynamics [J].

Dhiman, Chhavi ;

Vishwakarma, Dinesh Kumar .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :3835-3844

[9]

Donahue J, 2015, PROC CVPR IEEE, P2625, DOI 10.1109/CVPR.2015.7298878

[10] Scatter Component Analysis: A Unified Framework for Domain Adaptation and Domain Generalization [J].

Ghifary, Muhammad ;

Balduzzi, David ;

Kleijn, W. Bastiaan ;

Zhang, Mengjie .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (07) :1414-1430

← 1 2 3 4 5 6 7 8 →