Egocentric Vision-based Action Recognition: A survey

被引:23
|
作者
Nunez-Marcos, Adrian [1 ]
Azkune, Gorka [2 ]
Arganda-Carreras, Ignacio [3 ,4 ,5 ]
机构
[1] Univ Deusto, Deustotech Inst, Ave Universidades 24, Bilbao 48007, Spain
[2] Euskal Herriko Unibertsitatea EHU UPV, IXA NLP Grp, Fac Comp Sci, M Lardizabal 1, Donostia San Sebastian 20008, Spain
[3] Donostia Int Phys Ctr DIPC, Manuel Lardizabal 4, Donostia San Sebastian 20018, Spain
[4] Ikerbasque, Basque Fdn Sci, Plaza Euskadi 5, Bilbao 48009, Spain
[5] Univ Basque Country, Dept Comp Sci & Artificial Intelligence, M Lardizabal 1, Donostia San Sebastian 20008, Spain
关键词
Deep learning; Computer vision; Human action recognition; Egocentric vision; Few-shot learning; 1ST-PERSON ACTION RECOGNITION; TEXTURE MEASURES; FEATURES; OBJECTS;
D O I
10.1016/j.neucom.2021.11.081
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The egocentric action recognition EAR field has recently increased its popularity due to the affordable and lightweight wearable cameras available nowadays such as GoPro and similars. Therefore, the amount of egocentric data generated has increased, triggering the interest in the understanding of egocentric videos. More specifically, the recognition of actions in egocentric videos has gained popularity due to the challenge that it poses: the wild movement of the camera and the lack of context make it hard to recognise actions with a performance similar to that of third-person vision solutions. This has ignited the research interest on the field and, nowadays, many public datasets and competitions can be found in both the machine learning and the computer vision communities. In this survey, we aim to analyse the literature on egocentric vision methods and algorithms. For that, we propose a taxonomy to divide the literature into various categories with subcategories, contributing a more fine-grained classification of the available methods. We also provide a review of the zero-shot approaches used by the EAR community, a methodology that could help to transfer EAR algorithms to real-world applications. Finally, we summarise the datasets used by researchers in the literature. (c) 2021 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:175 / 197
页数:23
相关论文
共 50 条
  • [1] A survey on vision-based human action recognition
    Poppe, Ronald
    IMAGE AND VISION COMPUTING, 2010, 28 (06) : 976 - 990
  • [2] A Comprehensive Survey of Vision-Based Human Action Recognition Methods
    Zhang, Hong-Bo
    Zhang, Yi-Xiang
    Zhong, Bineng
    Lei, Qing
    Yang, Lijie
    Du, Ji-Xiang
    Chen, Duan-Sheng
    SENSORS, 2019, 19 (05)
  • [3] A survey of vision-based methods for action representation, segmentation and recognition
    Weinland, Daniel
    Ronfard, Remi
    Boyer, Edmond
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2011, 115 (02) : 224 - 241
  • [4] Vision-Based Gait Recognition: A Survey
    Singh, Jasvinder Pal
    Jain, Sanjeev
    Arora, Sakshi
    Singh, Uday Pratap
    IEEE ACCESS, 2018, 6 : 70497 - 70527
  • [5] Vision-based human activity recognition: a survey
    Djamila Romaissa Beddiar
    Brahim Nini
    Mohammad Sabokrou
    Abdenour Hadid
    Multimedia Tools and Applications, 2020, 79 : 30509 - 30555
  • [6] Vision-based human activity recognition: a survey
    Beddiar, Djamila Romaissa
    Nini, Brahim
    Sabokrou, Mohammad
    Hadid, Abdenour
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (41-42) : 30509 - 30555
  • [7] WIFI ACTION RECOGNITION VIA VISION-BASED METHODS
    Chang, Jen-Yin
    Lee, Kuan-Ying
    Lin, Kate Ching-Ju
    Hsu, Winston
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 2782 - 2786
  • [8] Episodic Reasoning for Vision-Based Human Action Recognition
    Santofimia, Maria J.
    Martinez-del-Rincon, Jesus
    Nebel, Jean-Christophe
    SCIENTIFIC WORLD JOURNAL, 2014,
  • [9] An Overview of the Vision-Based Human Action Recognition Field
    Camarena, Fernando
    Gonzalez-Mendoza, Miguel
    Chang, Leonardo
    Cuevas-Ascencio, Ricardo
    MATHEMATICAL AND COMPUTATIONAL APPLICATIONS, 2023, 28 (02)
  • [10] Survey on vision-based dynamic hand gesture recognition
    Tripathi, Reena
    Verma, Bindu
    VISUAL COMPUTER, 2024, 40 (09): : 6171 - 6199