Everyday concept detection in visual lifelogs: validation, relationships and trends

被引：26

作者：

Byrne, Daragh ^{[1
]}

Doherty, Aiden R. ^{[1
]}

Snoek, Cees G. M. ^{[2
]}

Jones, Gareth J. F. ^{[3
]}

Smeaton, Alan F. ^{[1
]}

机构：

[1] Dublin City Univ, CLARITY Ctr Sensor Web Technol, Dublin 9, Ireland

[2] Univ Amsterdam, Intelligent Syst Lab Amsterdam, NL-1098 XG Amsterdam, Netherlands

[3] Dublin City Univ, Ctr Digital Video Proc, Dublin 9, Ireland

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2010年 / 49卷 / 01期

基金：

爱尔兰科学基金会;

关键词：

Microsoft SenseCam; Lifelog; Passive photos; Concept detection; Supervised learning;

D O I：

10.1007/s11042-009-0403-8

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The Microsoft SenseCam is a small lightweight wearable camera used to passively capture photos and other sensor readings from a user's day-to-day activities. It captures on average 3,000 images in a typical day, equating to almost 1 million images per year. It can be used to aid memory by creating a personal multimedia lifelog, or visual recording of the wearer's life. However the sheer volume of image data captured within a visual lifelog creates a number of challenges, particularly for locating relevant content. Within this work, we explore the applicability of semantic concept detection, a method often used within video retrieval, on the domain of visual lifelogs. Our concept detector models the correspondence between low-level visual features and high-level semantic concepts (such as indoors, outdoors, people, buildings, etc.) using supervised machine learning. By doing so it determines the probability of a concept's presence. We apply detection of 27 everyday semantic concepts on a lifelog collection composed of 257,518 SenseCam images from 5 users. The results were evaluated on a subset of 95,907 images, to determine the accuracy for detection of each semantic concept. We conducted further analysis on the temporal consistency, co-occurance and relationships within the detected concepts to more extensively investigate the robustness of the detectors within this domain.

引用

页码：119 / 144

页数：26

共 37 条

[1]

[Anonymous], CIVR 07

[2]

[Anonymous], 2007, COLUMBIA U BASELINE

[3]

[Anonymous], BIOMETRICS

[4]

BELL G, 2007, SCI AM NEW YORK

[5] MULTICHANNEL TEXTURE ANALYSIS USING LOCALIZED SPATIAL FILTERS [J].

BOVIK, AC ;

CLARK, M ;

GEISLER, WS .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1990, 12 (01) :55-73

[6]

Byrne D., 2007, IMAI'07-5th International Conference on Intelligent Multimedia and Ambient Intelligence, P1454

[7]

Byrne D, 2008, LECT NOTES COMPUT SC, V5392, P15, DOI 10.1007/978-3-540-92235-3_4

[8] LIBSVM: A Library for Support Vector Machines [J].

Chang, Chih-Chung ;

Lin, Chih-Jen .

ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)

[9]

CHANG SF, 2008, P TRECVID WORKSH GAI

[10]

DeVaul R.W., 2001, Real-time motion classification for wearable computing applications

← 1 2 3 4 →