Everyday concept detection in visual lifelogs: validation, relationships and trends

被引:26
作者
Byrne, Daragh [1 ]
Doherty, Aiden R. [1 ]
Snoek, Cees G. M. [2 ]
Jones, Gareth J. F. [3 ]
Smeaton, Alan F. [1 ]
机构
[1] Dublin City Univ, CLARITY Ctr Sensor Web Technol, Dublin 9, Ireland
[2] Univ Amsterdam, Intelligent Syst Lab Amsterdam, NL-1098 XG Amsterdam, Netherlands
[3] Dublin City Univ, Ctr Digital Video Proc, Dublin 9, Ireland
基金
爱尔兰科学基金会;
关键词
Microsoft SenseCam; Lifelog; Passive photos; Concept detection; Supervised learning;
D O I
10.1007/s11042-009-0403-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Microsoft SenseCam is a small lightweight wearable camera used to passively capture photos and other sensor readings from a user's day-to-day activities. It captures on average 3,000 images in a typical day, equating to almost 1 million images per year. It can be used to aid memory by creating a personal multimedia lifelog, or visual recording of the wearer's life. However the sheer volume of image data captured within a visual lifelog creates a number of challenges, particularly for locating relevant content. Within this work, we explore the applicability of semantic concept detection, a method often used within video retrieval, on the domain of visual lifelogs. Our concept detector models the correspondence between low-level visual features and high-level semantic concepts (such as indoors, outdoors, people, buildings, etc.) using supervised machine learning. By doing so it determines the probability of a concept's presence. We apply detection of 27 everyday semantic concepts on a lifelog collection composed of 257,518 SenseCam images from 5 users. The results were evaluated on a subset of 95,907 images, to determine the accuracy for detection of each semantic concept. We conducted further analysis on the temporal consistency, co-occurance and relationships within the detected concepts to more extensively investigate the robustness of the detectors within this domain.
引用
收藏
页码:119 / 144
页数:26
相关论文
共 37 条
[1]  
[Anonymous], CIVR 07
[2]  
[Anonymous], 2007, COLUMBIA U BASELINE
[3]  
[Anonymous], BIOMETRICS
[4]  
BELL G, 2007, SCI AM NEW YORK
[5]   MULTICHANNEL TEXTURE ANALYSIS USING LOCALIZED SPATIAL FILTERS [J].
BOVIK, AC ;
CLARK, M ;
GEISLER, WS .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1990, 12 (01) :55-73
[6]  
Byrne D., 2007, IMAI'07-5th International Conference on Intelligent Multimedia and Ambient Intelligence, P1454
[7]  
Byrne D, 2008, LECT NOTES COMPUT SC, V5392, P15, DOI 10.1007/978-3-540-92235-3_4
[8]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[9]  
CHANG SF, 2008, P TRECVID WORKSH GAI
[10]  
DeVaul R.W., 2001, Real-time motion classification for wearable computing applications