Recognizing Personal Locations From Egocentric Videos

被引:28
|
作者
Furnari, Antonino [1 ]
Farinella, Giovanni Maria [1 ]
Battiato, Sebastiano [1 ]
机构
[1] Univ Catania, Dept Math & Comp Sci, I-95124 Catania, Italy
关键词
Context-aware computing; egocentric dataset; egocentric vision; first person vision; personal location recognition; CONTEXT; CLASSIFICATION; RECOGNITION; SCENE; SHAPE;
D O I
10.1109/THMS.2016.2612002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Contextual awareness in wearable computing allows for construction of intelligent systems, which are able to interact with the user in a more natural way. In this paper, we study how personal locations arising from the user's daily activities can be recognized from egocentric videos. We assume that few training samples are available for learning purposes. Considering the diversity of the devices available on the market, we introduce a benchmark dataset containing egocentric videos of eight personal locations acquired by a user with four different wearable cameras. To make our analysis useful in real-world scenarios, we propose a method to reject negative locations, i.e., those not belonging to any of the categories of interest for the end-user. We assess the performances of the main state-of-the-art representations for scene and object classification on the considered task, as well as the influence of device-specific factors such as the field of view and the wearing modality. Concerning the different device-specific factors, experiments revealed that the best results are obtained using a head-mounted wide-angular device. Our analysis shows the effectiveness of using representations based on convolutional neural networks, employing basic transfer learning techniques and an entropy-based rejection algorithm.
引用
收藏
页码:6 / 18
页数:13
相关论文
共 50 条
  • [41] Human detection from images and videos: A survey
    Duc Thanh Nguyen
    Li, Wanqing
    Ogunbona, Philip O.
    PATTERN RECOGNITION, 2016, 51 : 148 - 175
  • [42] Recognizing familiar objects by hand and foot: Haptic shape perception generalizes to inputs from unusual locations and untrained body parts
    Rebecca Lawson
    Attention, Perception, & Psychophysics, 2014, 76 : 541 - 558
  • [43] Recognizing familiar objects by hand and foot: Haptic shape perception generalizes to inputs from unusual locations and untrained body parts
    Lawson, Rebecca
    ATTENTION PERCEPTION & PSYCHOPHYSICS, 2014, 76 (02) : 541 - 558
  • [44] Machine learning for recognizing minerals from multispectral data
    Jahoda, Pavel
    Drozdovskiy, Igor
    Payler, Samuel J.
    Turchi, Leonardo
    Bessone, Loredana
    Sauro, Francesco
    ANALYST, 2021, 146 (01) : 184 - 195
  • [45] Using 3D Convolutional Neural Network in Surveillance Videos for Recognizing Human Actions
    Pushparaj, Sathyashrisharmilha
    Arumugam, Sakthivel
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2018, 15 (04) : 693 - 700
  • [46] Goal-oriented top-down probabilistic visual attention model for recognition of manipulated objects in egocentric videos
    Buso, Vincent
    Gonzalez-Diaz, Ivan
    Benois-Pineau, Jenny
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2015, 39 : 418 - 431
  • [47] Obscenity detection transformer for detecting inappropriate contents from videos
    Rautela, Kamakshi
    Sharma, Dhruv
    Kumar, Vijay
    Kumar, Dinesh
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (04) : 10799 - 10814
  • [48] Recognizing activities of daily living from UWB radars and deep learning
    Maitre J.
    Bouchard K.
    Bertuglia C.
    Gaboury S.
    1600, Elsevier Ltd (164):
  • [49] Random Deep Belief Networks for Recognizing Emotions from Speech Signals
    Wen, Guihua
    Li, Huihui
    Huang, Jubing
    Li, Danyang
    Xun, Eryang
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2017, 2017
  • [50] Whose hand is this? Person Identification from Egocentric Hand Gestures
    Tsutsui, Satoshi
    Fu, Yanwei
    Crandall, David
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 3398 - 3407