Recognizing Personal Locations From Egocentric Videos

被引:28
|
作者
Furnari, Antonino [1 ]
Farinella, Giovanni Maria [1 ]
Battiato, Sebastiano [1 ]
机构
[1] Univ Catania, Dept Math & Comp Sci, I-95124 Catania, Italy
关键词
Context-aware computing; egocentric dataset; egocentric vision; first person vision; personal location recognition; CONTEXT; CLASSIFICATION; RECOGNITION; SCENE; SHAPE;
D O I
10.1109/THMS.2016.2612002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Contextual awareness in wearable computing allows for construction of intelligent systems, which are able to interact with the user in a more natural way. In this paper, we study how personal locations arising from the user's daily activities can be recognized from egocentric videos. We assume that few training samples are available for learning purposes. Considering the diversity of the devices available on the market, we introduce a benchmark dataset containing egocentric videos of eight personal locations acquired by a user with four different wearable cameras. To make our analysis useful in real-world scenarios, we propose a method to reject negative locations, i.e., those not belonging to any of the categories of interest for the end-user. We assess the performances of the main state-of-the-art representations for scene and object classification on the considered task, as well as the influence of device-specific factors such as the field of view and the wearing modality. Concerning the different device-specific factors, experiments revealed that the best results are obtained using a head-mounted wide-angular device. Our analysis shows the effectiveness of using representations based on convolutional neural networks, employing basic transfer learning techniques and an entropy-based rejection algorithm.
引用
收藏
页码:6 / 18
页数:13
相关论文
共 50 条
  • [31] Recognizing Gestures from Videos using a Network with Two-branch Structure and Additional Motion Cues
    Zhou, Jiaxin
    Komuro, Takashi
    2020 15TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2020), 2020, : 133 - 137
  • [32] Ego2Top: Matching Viewers in Egocentric and Top-View Videos
    Ardeshir, Shervin
    Borji, Ali
    COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 : 253 - 268
  • [33] ViSSa: Recognizing the appropriateness of videos on social media with on-demand crowdsourcing
    Mridha, Sankar Kumar
    Sarkar, Braznev
    Chatterjee, Sujoy
    Bhattacharyya, Malay
    INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (03)
  • [34] Predicting the future from first person (egocentric) vision: A survey
    Rodin, Ivan
    Furnari, Antonino
    Mavroeidis, Dimitrios
    Farinella, Giovanni Maria
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 211
  • [35] Online annotation of faces in personal videos by sequential learning
    Yilmazturk, M. C.
    Ulusoy, I.
    Cicekli, N. K.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2013, 63 (03) : 591 - 613
  • [36] Into the Wild: Transitioning from Recognizing Mood in Clinical Interactions to Personal Conversations for Individuals with Bipolar Disorder
    Matton, Katie
    McInnis, Melvin G.
    Provost, Emily Mower
    INTERSPEECH 2019, 2019, : 1438 - 1442
  • [37] Exploring STIP-based models for recognizing human interactions in TV videos
    Marin-Jimenez, Manuel J.
    Yeguas, Enrique
    Perez de la Blanca, Nicolas
    PATTERN RECOGNITION LETTERS, 2013, 34 (15) : 1819 - 1828
  • [38] Recognizing Food Places in Egocentric Photo-Streams Using Multi-Scale Atrous Convolutional Networks and Self-Attention Mechanism
    Sarker, Md Mostafa Kamal
    Rashwan, Hatem A.
    Akram, Farhan
    Talavera, Estefania
    Banu, Syeda Furruka
    Radeva, Petia
    Puig, Domenec
    IEEE ACCESS, 2019, 7 : 39069 - 39082
  • [39] INTERACTION-GCN: A GRAPH CONVOLUTIONAL NETWORK BASED FRAMEWORK FOR SOCIAL INTERACTION RECOGNITION IN EGOCENTRIC VIDEOS
    Felicioni, Simone
    Dimiccoli, Mariella
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 2348 - 2352
  • [40] On Recognizing Faces in Videos Using Clustering-Based Re-Ranking and Fusion
    Bhatt, Himanshu S.
    Singh, Richa
    Vatsa, Mayank
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2014, 9 (07) : 1056 - 1068