Deep appearance and motion learning for egocentric activity recognition

被引:35
|
作者
Wang, Xuanhan [1 ]
Gao, Lianli [1 ]
Song, Jingkuan [2 ]
Zhen, Xiantong [3 ]
Sebe, Nicu [4 ]
Shen, Heng Tao [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Sichuan, Peoples R China
[2] Columbia Univ, Sch Engn & Appl Sci, New York, NY 10027 USA
[3] Univ Western Ontario, Digital Imaging Grp, London, ON N6A 4V2, Canada
[4] Univ Trento, Dept Informat Engn & Comp Sci, I-38100 Trento, Italy
基金
中国国家自然科学基金;
关键词
Multiple feature learning; Deep learning; Autoencoder; Egocentric video; Activity recognition;
D O I
10.1016/j.neucom.2017.08.063
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Egocentric activity recognition has recently generated great popularity in computer vision due to its widespread applications in egocentric video analysis. However, it poses new challenges comparing to the conventional third-person activity recognition tasks, which are caused by significant body shaking, varied lengths, and poor recoding quality, etc. To handle these challenges, in this paper, we propose deep appearance and motion learning (DAML) for egocentric activity recognition, which leverages the great strength of deep learning networks in feature learning. In contrast to hand- crafted visual features or pre-trained convolutional neural network (CNN) features with limited generality to new egocentric videos, the proposed DAML is built on the deep autoencoder (DAE), and directly extracts appearance and motion feature, the main cue of activities, from egocentric videos. The DAML takes advantages of the great effectiveness and efficiency of the DAE in unsupervised feature learning, which provides a new representation learning framework of egocentric videos. The learned appearance and motion features by the DAML are seamlessly fused to accomplish a rich informative egocentric activity representation which can be readily fed into any supervised learning models for activity recognition. Experimental results on two challenging benchmark datasets show that the DAML achieves high performance on both short- and long-term egocentric activity recognition tasks, which is comparable to or even better than the state-of-the-art counterparts. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:438 / 447
页数:10
相关论文
共 50 条
  • [1] Fusion of Appearance and Motion Features for Daily Activity Recognition from Egocentric Perspective
    Lye, Mohd Haris
    AlDahoul, Nouar
    Abdul Karim, Hezerul
    SENSORS, 2023, 23 (15)
  • [2] Egocentric Vision for Human Activity Recognition Using Deep Learning
    Douache, Malika
    Benmoussat, Badra Nawal
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2023, 19 (06): : 730 - 744
  • [3] Multimodal Multi-stream Deep Learning for Egocentric Activity Recognition
    Song, Sibo
    Chandrasekhar, Vijay
    Mandal, Bappaditya
    Li, Liyuan
    Lim, Joo-Hwee
    Babu, Giduthuri Sateesh
    San, Phyo Phyo
    Cheung, Ngai-Man
    PROCEEDINGS OF 29TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, (CVPRW 2016), 2016, : 378 - 385
  • [4] Towards Continual Egocentric Activity Recognition: A Multi-Modal Egocentric Activity Dataset for Continual Learning
    Xu, Linfeng
    Wu, Qingbo
    Pan, Lili
    Meng, Fanman
    Li, Hongliang
    He, Chiyuan
    Wang, Hanxin
    Cheng, Shaoxu
    Dai, Yu
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 2430 - 2443
  • [5] Human Activity Recognition using Binary Motion Image and Deep Learning
    Dobhal, Tushar
    Shitole, Vivswan
    Thomas, Gabriel
    Navada, Girisha
    SECOND INTERNATIONAL SYMPOSIUM ON COMPUTER VISION AND THE INTERNET (VISIONNET'15), 2015, 58 : 178 - 185
  • [6] Egocentric Activity Recognition on a Budget
    Possas, Rafael
    Caceres, Sheila Pinto
    Ramos, Fabio
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5967 - 5976
  • [7] RECOGNIZING ACTIVITIES FROM EGOCENTRIC IMAGES WITH APPEARANCE AND MOTION FEATURES
    Chen, Yanhua
    Pei, Mingtao
    Nie, Zhengang
    2021 IEEE 31ST INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2021,
  • [8] Recognition of human motion with deep reinforcement learning
    Seok W.
    Park C.
    IEIE Transactions on Smart Processing and Computing, 2018, 7 (03): : 245 - 250
  • [9] Motion Recognition Based on Deep Learning Algorithm
    Wang, Xue
    Liu, Li
    Zhang, Yingxing
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2024, 38 (14)
  • [10] Deep Attention Network for Egocentric Action Recognition
    Lu, Minlong
    Li, Ze-Nian
    Wang, Yueming
    Pan, Gang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (08) : 3703 - 3713