Visual Object Tracking in First Person Vision

被引:18
作者
Dunnhofer, Matteo [1 ]
Furnari, Antonino [2 ]
Farinella, Giovanni Maria [2 ]
Micheloni, Christian [1 ]
机构
[1] Univ Udine, Machine Learning & Percept Lab, Via Sci 206, I-33100 Udine, Italy
[2] Univ Catania, Image Proc Lab, Viale A Doria 6, I-95125 Catania, Italy
关键词
First person vision; Egocentric vision; Visual object tracking; Single object tracking; ROBUST;
D O I
10.1007/s11263-022-01694-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The understanding of human-object interactions is fundamental in First Person Vision (FPV). Visual tracking algorithms which follow the objects manipulated by the camera wearer can provide useful information to effectively model such interactions. In the last years, the computer vision community has significantly improved the performance of tracking algorithms for a large variety of target objects and scenarios. Despite a few previous attempts to exploit trackers in the FPV domain, a methodical analysis of the performance of state-of-the-art trackers is still missing. This research gap raises the question of whether current solutions can be used "off-the-shelf" or more domain-specific investigations should be carried out. This paper aims to provide answers to such questions. We present the first systematic investigation of single object tracking in FPV. Our study extensively analyses the performance of 42 algorithms including generic object trackers and baseline FPV-specific trackers. The analysis is carried out by focusing on different aspects of the FPV setting, introducing new performance measures, and in relation to FPV-specific tasks. The study is made possible through the introduction of TREK-150, a novel benchmark dataset composed of 150 densely annotated video sequences. Our results show that object tracking in FPV poses new challenges to current visual trackers. We highlight the factors causing such behavior and point out possible research directions. Despite their difficulties, we prove that trackers bring benefits to FPV downstream tasks requiring short-term object tracking. We expect that generic object tracking will gain popularity in FPV as new and FPV-specific methodologies are investigated.
引用
收藏
页码:259 / 283
页数:25
相关论文
共 117 条
[1]  
Aghaei M., 2016, ICPR DETECTING SOCIA
[2]   Multi-face tracking by extended bag-of-tracklets in egocentric photo-streams [J].
Aghaei, Maedeh ;
Dimiccoli, Mariella ;
Radeva, Petia .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2016, 149 :146-156
[3]   Egocentric Object Tracking: An Odometry-Based Solution [J].
Alletto, Stefano ;
Serra, Giuseppe ;
Cucchiara, Rita .
IMAGE ANALYSIS AND PROCESSING - ICIAP 2015, PT II, 2015, 9280 :687-696
[4]  
[Anonymous], 2018, Construction and Analysis of a Large Scale Image Ontology
[5]  
Bertasius G, 2017, ROBOTICS: SCIENCE AND SYSTEMS XIII
[6]   Unsupervised Learning of Important Objects from First-Person Videos [J].
Bertasius, Gedas ;
Park, Hyun Soo ;
Yu, Stella X. ;
Shi, Jianbo .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :1974-1982
[7]   Staple: Complementary Learners for Real-Time Tracking [J].
Bertinetto, Luca ;
Valmadre, Jack ;
Golodetz, Stuart ;
Miksik, Ondrej ;
Torr, Philip H. S. .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :1401-1409
[8]   Fully-Convolutional Siamese Networks for Object Tracking [J].
Bertinetto, Luca ;
Valmadre, Jack ;
Henriques, Joao F. ;
Vedaldi, Andrea ;
Torr, Philip H. S. .
COMPUTER VISION - ECCV 2016 WORKSHOPS, PT II, 2016, 9914 :850-865
[9]  
Bewley A, 2016, IEEE IMAGE PROC, P3464, DOI 10.1109/ICIP.2016.7533003
[10]   Know Your Surroundings: Exploiting Scene Information for Object Tracking [J].
Bhat, Goutam ;
Danelljan, Martin ;
Van Gool, Luc ;
Timofte, Radu .
COMPUTER VISION - ECCV 2020, PT XXIII, 2020, 12368 :205-221