Joint Transferable Dictionary Learning and View Adaptation for Multi-view Human Action Recognition

被引：10

作者：

Sun, Bin ^{[1
]}

Kong, Dehui ^{[1
]}

Wang, Shaofan ^{[1
]}

Wang, Lichun ^{[1
]}

Yin, Baocai ^{[1
]}

机构：

[1] Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing 100124, Peoples R China

来源：

ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA | 2021年 / 15卷 / 02期

基金：

中国国家自然科学基金; 北京市自然科学基金;

关键词：

Action recognition; multi-view; sparse representation; transfer learning; REPRESENTATION; SURVEILLANCE;

D O I：

10.1145/3434746

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Multi-view human action recognition remains a challenging problem due to large view changes. In this article, we propose a transfer learning-based framework called transferable dictionary learning and view adaptation (TDVA) model for multi-view human action recognition. In the transferable dictionary learning phase, TDVA learns a set of view-specific transferable dictionaries enabling the same actions from different views to share the same sparse representations, which can transfer features of actions from different views to an intermediate domain. In the view adaptation phase, TDVA comprehensively analyzes global, local, and individual characteristics of samples, and jointly learns balanced distribution adaptation, locality preservation, and discrimination preservation, aiming at transferring sparse features of actions of different views from the intermediate domain to a common domain. In other words, TDVA progressively bridges the distribution gap among actions from various views by these two phases. Experimental results on IXMAS, ACT4(2), and NUCLA action datasets demonstrate that TDVA outperforms state-of-the-art methods.

引用

页数：23

共 72 条

[41] Heterogeneous discriminant analysis for cross-view action recognition [J].

Sui, Wanchen ;

Wu, Xinxiao ;

Feng, Yang ;

Jia, Yunde .

NEUROCOMPUTING, 2016, 191 :286-295

[42] Multilayer Convolutional Sparse Modeling: Pursuit and Dictionary Learning [J].

Sulam, Jeremias ;

Papyan, Vardan ;

Romano, Yaniv ;

Elad, Michael .

IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2018, 66 (15) :4090-4104

[43] Effective human action recognition using global and local offsets of skeleton joints [J].

Sun, Bin ;

Kong, Dehui ;

Wang, Shaofan ;

Wang, Lichun ;

Wang, Yuping ;

Yin, Baocai .

MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (05) :6329-6353

[44]

Tan B, 2017, AAAI CONF ARTIF INTE, P2604

[45] Signal recovery from random measurements via orthogonal matching pursuit [J].

Tropp, Joel A. ;

Gilbert, Anna C. .

IEEE TRANSACTIONS ON INFORMATION THEORY, 2007, 53 (12) :4655-4666

[46]

van der Maaten L, 2008, J MACH LEARN RES, V9, P2579

[47] Action Recognition with Improved Trajectories [J].

Wang, Heng ;

Schmid, Cordelia .

2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :3551-3558

[48] Cross-view Action Modeling, Learning and Recognition [J].

Wang, Jiang ;

Nie, Xiaohan ;

Xia, Yin ;

Wu, Ying ;

Zhu, Song-Chun .

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :2649-2656

[49] Visual Domain Adaptation with Manifold Embedded Distribution Alignment [J].

Wang, Jindong ;

Feng, Wenjie ;

Chen, Yiqiang ;

Yu, Han ;

Huang, Meiyu ;

Yu, Philip S. .

PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, :402-410

[50] Cross-View Action Recognition Based on a Statistical Translation Framework [J].

Wang, Jing ;

Zheng, Huicheng ;

Gao, Jinyu ;

Cen, Jiepeng .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2016, 26 (08) :1461-1475

← 1 2 3 4 5 6 7 8 →