Unsupervised understanding of location and illumination changes in egocentric videos

被引:3
作者
Betancourt, Alejandro [1 ,2 ]
Diaz-Rodriguez, Natalia [3 ]
Barakova, Emilia [2 ]
Marcenaro, Lucio [1 ]
Rauterberg, Matthias [2 ]
Regazzoni, Carlo [1 ]
机构
[1] Univ Genoa, Dept Engn DITEN, Genoa, Italy
[2] Eindhoven Univ Technol, Dept Ind Design, Eindhoven, Netherlands
[3] Univ Calif Santa Cruz, Comp Sci Dept, Santa Cruz, CA 95064 USA
关键词
Machine learning; Unsupervised learning; Egocentric videos; First person vision; Wearable camera; ACTIVITY RECOGNITION; VISION; PRESERVATION; MAPS;
D O I
10.1016/j.pmcj.2017.03.016
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Wearable cameras stand out as one of the most promising devices for the upcoming years, and as a consequence, the demand of computer algorithms to automatically understand the videos recorded with them is increasing quickly. An automatic understanding of these videos is not an easy task, and its mobile nature implies important challenges to be faced, such as the changing light conditions and the unrestricted locations recorded. This paper proposes an unsupervised strategy based on global features and manifold learning to endow wearable cameras with contextual information regarding the light conditions and the location captured. Results show that non-linear manifold methods can capture contextual patterns from global features without compromising large computational resources. The proposed strategy is used, as an application case, as a switching mechanism to improve the hand-detection problem in egocentric videos. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:414 / 429
页数:16
相关论文
共 49 条
[21]  
Fathi A., 2011, 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), P3281, DOI 10.1109/CVPR.2011.5995444
[22]  
Fathi A, 2011, IEEE I CONF COMP VIS, P407, DOI 10.1109/ICCV.2011.6126269
[23]   On bias, variance, 0/1 - Loss, and the curse-of-dimensionality [J].
Friedman, JH .
DATA MINING AND KNOWLEDGE DISCOVERY, 1997, 1 (01) :55-77
[24]  
Fritzke Bernd., 1995, Advances in neural information processing systems, V7, P625
[25]   Remembering through lifelogging: A survey of human memory augmentation [J].
Harvey, Morgan ;
Langheinrich, Marc ;
Ward, Geoff .
PERVASIVE AND MOBILE COMPUTING, 2016, 27 :14-26
[26]  
Hori T., 2003, ACM SIGMM INT WORKSH, P31, DOI DOI 10.1145/973264.973270
[27]   A survey of skin-color modeling and detection methods [J].
Kakumanu, P. ;
Makrogiannis, S. ;
Bourbakis, N. .
PATTERN RECOGNITION, 2007, 40 (03) :1106-1122
[28]   THE SELF-ORGANIZING MAP [J].
KOHONEN, T .
PROCEEDINGS OF THE IEEE, 1990, 78 (09) :1464-1480
[29]  
Kohonen T., 2001, SPRINGER SERIES INFO, V30, P1
[30]   A Survey on Human Activity Recognition using Wearable Sensors [J].
Lara, Oscar D. ;
Labrador, Miguel A. .
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2013, 15 (03) :1192-1209