Egocentric Vision-based Action Recognition: A survey

被引：23

作者：

Nunez-Marcos, Adrian ^{[1
]}

Azkune, Gorka ^{[2
]}

Arganda-Carreras, Ignacio ^{[3
,4
,5
]}

机构：

[1] Univ Deusto, Deustotech Inst, Ave Universidades 24, Bilbao 48007, Spain

[2] Euskal Herriko Unibertsitatea EHU UPV, IXA NLP Grp, Fac Comp Sci, M Lardizabal 1, Donostia San Sebastian 20008, Spain

[3] Donostia Int Phys Ctr DIPC, Manuel Lardizabal 4, Donostia San Sebastian 20018, Spain

[4] Ikerbasque, Basque Fdn Sci, Plaza Euskadi 5, Bilbao 48009, Spain

[5] Univ Basque Country, Dept Comp Sci & Artificial Intelligence, M Lardizabal 1, Donostia San Sebastian 20008, Spain

来源：

NEUROCOMPUTING | 2022年 / 472卷

关键词：

Deep learning; Computer vision; Human action recognition; Egocentric vision; Few-shot learning; 1ST-PERSON ACTION RECOGNITION; TEXTURE MEASURES; FEATURES; OBJECTS;

D O I：

10.1016/j.neucom.2021.11.081

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The egocentric action recognition EAR field has recently increased its popularity due to the affordable and lightweight wearable cameras available nowadays such as GoPro and similars. Therefore, the amount of egocentric data generated has increased, triggering the interest in the understanding of egocentric videos. More specifically, the recognition of actions in egocentric videos has gained popularity due to the challenge that it poses: the wild movement of the camera and the lack of context make it hard to recognise actions with a performance similar to that of third-person vision solutions. This has ignited the research interest on the field and, nowadays, many public datasets and competitions can be found in both the machine learning and the computer vision communities. In this survey, we aim to analyse the literature on egocentric vision methods and algorithms. For that, we propose a taxonomy to divide the literature into various categories with subcategories, contributing a more fine-grained classification of the available methods. We also provide a review of the zero-shot approaches used by the EAR community, a methodology that could help to transfer EAR algorithms to real-world applications. Finally, we summarise the datasets used by researchers in the literature. (c) 2021 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

引用

页码：175 / 197

页数：23

共 50 条

[1] A survey on vision-based human action recognition
Poppe, Ronald
IMAGE AND VISION COMPUTING, 2010, 28 (06) : 976 - 990
[2] A Comprehensive Survey of Vision-Based Human Action Recognition Methods
Zhang, Hong-Bo
Zhang, Yi-Xiang
Zhong, Bineng
Lei, Qing
Yang, Lijie
Du, Ji-Xiang
Chen, Duan-Sheng
SENSORS, 2019, 19 (05)
[3] A survey of vision-based methods for action representation, segmentation and recognition
Weinland, Daniel
Ronfard, Remi
Boyer, Edmond
COMPUTER VISION AND IMAGE UNDERSTANDING, 2011, 115 (02) : 224 - 241
[4] Vision-Based Gait Recognition: A Survey
Singh, Jasvinder Pal
Jain, Sanjeev
Arora, Sakshi
Singh, Uday Pratap
IEEE ACCESS, 2018, 6 : 70497 - 70527
[5] Vision-based human activity recognition: a survey
Djamila Romaissa Beddiar
Brahim Nini
Mohammad Sabokrou
Abdenour Hadid
Multimedia Tools and Applications, 2020, 79 : 30509 - 30555
[6] Vision-based human activity recognition: a survey
Beddiar, Djamila Romaissa
Nini, Brahim
Sabokrou, Mohammad
Hadid, Abdenour
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (41-42) : 30509 - 30555
[7] WIFI ACTION RECOGNITION VIA VISION-BASED METHODS
Chang, Jen-Yin
Lee, Kuan-Ying
Lin, Kate Ching-Ju
Hsu, Winston
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 2782 - 2786
[8] Episodic Reasoning for Vision-Based Human Action Recognition
Santofimia, Maria J.
Martinez-del-Rincon, Jesus
Nebel, Jean-Christophe
SCIENTIFIC WORLD JOURNAL, 2014,
[9] An Overview of the Vision-Based Human Action Recognition Field
Camarena, Fernando
Gonzalez-Mendoza, Miguel
Chang, Leonardo
Cuevas-Ascencio, Ricardo
MATHEMATICAL AND COMPUTATIONAL APPLICATIONS, 2023, 28 (02)
[10] Survey on vision-based dynamic hand gesture recognition
Tripathi, Reena
Verma, Bindu
VISUAL COMPUTER, 2024, 40 (09): : 6171 - 6199

← 1 2 3 4 5 →