Deep appearance and motion learning for egocentric activity recognition

被引:35
|
作者
Wang, Xuanhan [1 ]
Gao, Lianli [1 ]
Song, Jingkuan [2 ]
Zhen, Xiantong [3 ]
Sebe, Nicu [4 ]
Shen, Heng Tao [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Sichuan, Peoples R China
[2] Columbia Univ, Sch Engn & Appl Sci, New York, NY 10027 USA
[3] Univ Western Ontario, Digital Imaging Grp, London, ON N6A 4V2, Canada
[4] Univ Trento, Dept Informat Engn & Comp Sci, I-38100 Trento, Italy
基金
中国国家自然科学基金;
关键词
Multiple feature learning; Deep learning; Autoencoder; Egocentric video; Activity recognition;
D O I
10.1016/j.neucom.2017.08.063
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Egocentric activity recognition has recently generated great popularity in computer vision due to its widespread applications in egocentric video analysis. However, it poses new challenges comparing to the conventional third-person activity recognition tasks, which are caused by significant body shaking, varied lengths, and poor recoding quality, etc. To handle these challenges, in this paper, we propose deep appearance and motion learning (DAML) for egocentric activity recognition, which leverages the great strength of deep learning networks in feature learning. In contrast to hand- crafted visual features or pre-trained convolutional neural network (CNN) features with limited generality to new egocentric videos, the proposed DAML is built on the deep autoencoder (DAE), and directly extracts appearance and motion feature, the main cue of activities, from egocentric videos. The DAML takes advantages of the great effectiveness and efficiency of the DAE in unsupervised feature learning, which provides a new representation learning framework of egocentric videos. The learned appearance and motion features by the DAML are seamlessly fused to accomplish a rich informative egocentric activity representation which can be readily fed into any supervised learning models for activity recognition. Experimental results on two challenging benchmark datasets show that the DAML achieves high performance on both short- and long-term egocentric activity recognition tasks, which is comparable to or even better than the state-of-the-art counterparts. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:438 / 447
页数:10
相关论文
共 50 条
  • [31] A Hierarchical Deep Fusion Framework for Egocentric Activity Recognition using a Wearable Hybrid Sensor System
    Yu, Haibin
    Pan, Guoxiong
    Pan, Mian
    Li, Chong
    Jia, Wenyan
    Zhang, Li
    Sun, Mingui
    SENSORS, 2019, 19 (03)
  • [32] Deep motion estimation through adversarial learning for gait recognition
    Yue, Yuanhao
    Shi, Laixiang
    Zheng, Zheng
    Chen, Long
    Wang, Zhongyuan
    Zou, Qin
    PATTERN RECOGNITION LETTERS, 2024, 184 : 232 - 237
  • [33] Motion Recognition Based on Deep Learning and Human Joint Points
    Wang, Junping
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [34] Deep Learning with a Spatiotemporal Descriptor of Appearance and Motion Estimation for Video Anomaly Detection
    Gunale, Kishanprasad G.
    Mukherji, Prachi
    JOURNAL OF IMAGING, 2018, 4 (06):
  • [35] An Attention-based Activity Recognition for Egocentric Video
    Matsuo, Kenji
    Yamada, Kentaro
    Ueno, Satoshi
    Naito, Sei
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2014, : 565 - +
  • [36] Activity Recognition in Egocentric Life-Logging Videos
    Song, Sibo
    Chandrasekhar, Vijay
    Cheung, Ngai-Man
    Narayan, Sanath
    Li, Liyuan
    Lim, Joo-Hwee
    COMPUTER VISION - ACCV 2014 WORKSHOPS, PT III, 2015, 9010 : 445 - 458
  • [37] Activity recognition using an egocentric perspective of everyday objects
    Surie, Dipak
    Pederson, Thomas
    Lagriffoul, Fabien
    Janlert, Lars-Erik
    Sjolie, Daniel
    UBIQUITOUS INTELLIGENCE AND COMPUTING, PROCEEDINGS, 2007, 4611 : 246 - +
  • [38] Integrating Human Gaze into Attention for Egocentric Activity Recognition
    Min, Kyle
    Corso, Jason J.
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 1068 - 1077
  • [39] Analysis of SVM and kNN Classifiers For Egocentric Activity Recognition
    Kumar, K. P. Sanal
    Bhavani, R.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATICS AND ANALYTICS (ICIA' 16), 2016,
  • [40] Human activity recognition using deep electroencephalography learning
    Salehzadeh, Amirsaleh
    Calitz, Andre P.
    Greyling, Jean
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2020, 62