Joint Deep Learning for RGB-D Action Recognition

被引：0

作者：

Qin, Xiaolei ^{[1
]}

Ge, Yongxin ^{[1
]}

Zhan, Liuwei ^{[2
]}

Li, Guangrui ^{[1
]}

Huang, Sheng ^{[1
]}

Wang, Hongxing ^{[1
]}

Chen, Feiyu ^{[1
]}

Wang, Hongxing ^{[1
]}

机构：

[1] Chongqing Univ, Sch Big Data & Software Engn, Chongqing, Peoples R China

[2] Chongqing Univ, Coll Optpelect Engn, Chongqing, Peoples R China

来源：

2018 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (IEEE VCIP) | 2018年

基金：

中国国家自然科学基金;

关键词：

action recognition; multimodal; common-specific features; joint;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recent approaches in RGB-based and depth-based human action recognition achieved outstanding performance respectively, which demonstrate the effectiveness of RGB and depth modalities for action classification, however it is infrequent to consider them both. Currently, available multimodal-based methods of action recognition suffer from some limitations, including non-end-to-end training, violent fusion and inefficiency. In this paper, we propose a novel joint deep learning (JDL) model which is capable of. 1) jointly optimizing the object of classification and feature extraction through a novel end-to-end two-stream deep learning model, 2) refining common-specific features via introducing the constraint of similarity loss in high-level, and 3) using 2D convolution kernel instead of 3D convolution kernel during feature extraction for gaining the efficiency. The experiments on two challenging datasets show the promising performance of our architecture.

引用

页数：6

共 32 条

[1]

[Anonymous], 2016, ARXIV160402808

[2]

[Anonymous], 2016, P CVPR

[3]

[Anonymous], 2017, arXiv

[4]

[Anonymous], 2013, PROCEEDINGS OF THE I, DOI DOI 10.1109/ICCV.2013.227

[5]

[Anonymous], 2017, ARXIV170803958

[6]

[Anonymous], 2013, IJCAI

[7] Productivity effects of IT-outsourcing: Semiparametric evidence for German companies [J].

Bertschek, Irene ;

Mueller, Marlene .

ART OF SEMIPARAMETRICS, 2006, :130-+

[8] Learning Spatiotemporal Features with 3D Convolutional Networks [J].

Du Tran ;

Bourdev, Lubomir ;

Fergus, Rob ;

Torresani, Lorenzo ;

Paluri, Manohar .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :4489-4497

[9]

Heng Wang, 2011, 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), P3169, DOI 10.1109/CVPR.2011.5995407

[10]

Hu JF, 2015, PROC CVPR IEEE, P5344, DOI 10.1109/CVPR.2015.7299172

← 1 2 3 4 →