Everyday concept detection in visual lifelogs: validation, relationships and trends

被引:25
作者
Byrne, Daragh [1 ]
Doherty, Aiden R. [1 ]
Snoek, Cees G. M. [2 ]
Jones, Gareth J. F. [3 ]
Smeaton, Alan F. [1 ]
机构
[1] Dublin City Univ, CLARITY Ctr Sensor Web Technol, Dublin 9, Ireland
[2] Univ Amsterdam, Intelligent Syst Lab Amsterdam, NL-1098 XG Amsterdam, Netherlands
[3] Dublin City Univ, Ctr Digital Video Proc, Dublin 9, Ireland
基金
爱尔兰科学基金会;
关键词
Microsoft SenseCam; Lifelog; Passive photos; Concept detection; Supervised learning;
D O I
10.1007/s11042-009-0403-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Microsoft SenseCam is a small lightweight wearable camera used to passively capture photos and other sensor readings from a user's day-to-day activities. It captures on average 3,000 images in a typical day, equating to almost 1 million images per year. It can be used to aid memory by creating a personal multimedia lifelog, or visual recording of the wearer's life. However the sheer volume of image data captured within a visual lifelog creates a number of challenges, particularly for locating relevant content. Within this work, we explore the applicability of semantic concept detection, a method often used within video retrieval, on the domain of visual lifelogs. Our concept detector models the correspondence between low-level visual features and high-level semantic concepts (such as indoors, outdoors, people, buildings, etc.) using supervised machine learning. By doing so it determines the probability of a concept's presence. We apply detection of 27 everyday semantic concepts on a lifelog collection composed of 257,518 SenseCam images from 5 users. The results were evaluated on a subset of 95,907 images, to determine the accuracy for detection of each semantic concept. We conducted further analysis on the temporal consistency, co-occurance and relationships within the detected concepts to more extensively investigate the robustness of the detectors within this domain.
引用
收藏
页码:119 / 144
页数:26
相关论文
共 37 条
  • [1] [Anonymous], CIVR 07
  • [2] [Anonymous], 2007, COLUMBIA U BASELINE
  • [3] [Anonymous], BIOMETRICS
  • [4] BELL G, 2007, SCI AM NEW YORK
  • [5] MULTICHANNEL TEXTURE ANALYSIS USING LOCALIZED SPATIAL FILTERS
    BOVIK, AC
    CLARK, M
    GEISLER, WS
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1990, 12 (01) : 55 - 73
  • [6] Byrne D., 2007, IMAI'07-5th International Conference on Intelligent Multimedia and Ambient Intelligence, P1454
  • [7] Byrne D, 2008, LECT NOTES COMPUT SC, V5392, P15, DOI 10.1007/978-3-540-92235-3_4
  • [8] LIBSVM: A Library for Support Vector Machines
    Chang, Chih-Chung
    Lin, Chih-Jen
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
  • [9] CHANG SF, 2008, P TRECVID WORKSH GAI
  • [10] DeVaul R.W., 2001, Real-time motion classification for wearable computing applications