Recognizing Personal Locations From Egocentric Videos

被引：28

作者：

Furnari, Antonino ^{[1
]}

Farinella, Giovanni Maria ^{[1
]}

Battiato, Sebastiano ^{[1
]}

机构：

[1] Univ Catania, Dept Math & Comp Sci, I-95124 Catania, Italy

来源：

IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS | 2017年 / 47卷 / 01期

关键词：

Context-aware computing; egocentric dataset; egocentric vision; first person vision; personal location recognition; CONTEXT; CLASSIFICATION; RECOGNITION; SCENE; SHAPE;

D O I：

10.1109/THMS.2016.2612002

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Contextual awareness in wearable computing allows for construction of intelligent systems, which are able to interact with the user in a more natural way. In this paper, we study how personal locations arising from the user's daily activities can be recognized from egocentric videos. We assume that few training samples are available for learning purposes. Considering the diversity of the devices available on the market, we introduce a benchmark dataset containing egocentric videos of eight personal locations acquired by a user with four different wearable cameras. To make our analysis useful in real-world scenarios, we propose a method to reject negative locations, i.e., those not belonging to any of the categories of interest for the end-user. We assess the performances of the main state-of-the-art representations for scene and object classification on the considered task, as well as the influence of device-specific factors such as the field of view and the wearing modality. Concerning the different device-specific factors, experiments revealed that the best results are obtained using a head-mounted wide-angular device. Our analysis shows the effectiveness of using representations based on convolutional neural networks, employing basic transfer learning techniques and an entropy-based rejection algorithm.

引用

页码：6 / 18

页数：13

共 50 条

[41] Human detection from images and videos: A survey
Duc Thanh Nguyen
Li, Wanqing
Ogunbona, Philip O.
PATTERN RECOGNITION, 2016, 51 : 148 - 175
[42] Recognizing familiar objects by hand and foot: Haptic shape perception generalizes to inputs from unusual locations and untrained body parts
Rebecca Lawson
Attention, Perception, & Psychophysics, 2014, 76 : 541 - 558
[43] Recognizing familiar objects by hand and foot: Haptic shape perception generalizes to inputs from unusual locations and untrained body parts
Lawson, Rebecca
ATTENTION PERCEPTION & PSYCHOPHYSICS, 2014, 76 (02) : 541 - 558
[44] Machine learning for recognizing minerals from multispectral data
Jahoda, Pavel
Drozdovskiy, Igor
Payler, Samuel J.
Turchi, Leonardo
Bessone, Loredana
Sauro, Francesco
ANALYST, 2021, 146 (01) : 184 - 195
[45] Using 3D Convolutional Neural Network in Surveillance Videos for Recognizing Human Actions
Pushparaj, Sathyashrisharmilha
Arumugam, Sakthivel
INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2018, 15 (04) : 693 - 700
[46] Goal-oriented top-down probabilistic visual attention model for recognition of manipulated objects in egocentric videos
Buso, Vincent
Gonzalez-Diaz, Ivan
Benois-Pineau, Jenny
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2015, 39 : 418 - 431
[47] Obscenity detection transformer for detecting inappropriate contents from videos
Rautela, Kamakshi
Sharma, Dhruv
Kumar, Vijay
Kumar, Dinesh
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (04) : 10799 - 10814
[48] Recognizing activities of daily living from UWB radars and deep learning
Maitre J.
Bouchard K.
Bertuglia C.
Gaboury S.
1600, Elsevier Ltd (164):
[49] Random Deep Belief Networks for Recognizing Emotions from Speech Signals
Wen, Guihua
Li, Huihui
Huang, Jubing
Li, Danyang
Xun, Eryang
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2017, 2017
[50] Whose hand is this? Person Identification from Egocentric Hand Gestures
Tsutsui, Satoshi
Fu, Yanwei
Crandall, David
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 3398 - 3407

← 1 2 3 4 5 →