A Scalable approach to activity recognition based on object use

被引：74

作者：

Wu, Jianxin ^{[1
]}

Osuntogun, Adebola ^{[1
]}

Choudhury, Tanzeem ^{[2
]}

Philipose, Matthai ^{[2
]}

Rehg, James M. ^{[1
]}

机构：

[1] Georgia Inst Technol, Coll Comp, Atlanta, GA 30332 USA

[2] Intel Res Seattle, Washington, DC USA

来源：

2007 IEEE 11TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1-6 | 2007年

关键词：

D O I：

10.1109/ICCV.2007.4408865

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose an approach to activity recognition based on detecting and analyzing the sequence of objects that are being manipulated by the user In domains such as cooking, where many activities involve similar actions, object-use information can be a valuable cue. In order for this approach to scale to many activities and objects, however, it is necessary to minimize the amount of human-labeled data that is required for modeling. We describe a method for automatically acquiring object models from video without any explicit human supervision. Our approach leverages sparse and noisy readings from RFID tagged objects, along with common-sense knowledge about which objects are likely to be used during a given activity, to bootstrap the learning process. We present a dynamic Bayesian network model which combines RFID and video data to jointly infer the most likely activity and object labels. We demonstrate that our approach can achieve activity recognition rates of more than 80% on a real-world dataset consisting of 16 household activities involving 33 objects with significant background clutter We show that the combination of visual object recognition with RFID data is significantly more effective than the RFID sensor alone. Our work demonstrates that it is possible to automatically learn object models from video of household activities and employ these models for activity recognition, without requiring any explicit human labeling.

引用

页码：290 / +

页数：2

共 27 条

[1]

[Anonymous], P IEEE C COMP VIS PA

[2]

[Anonymous], 2006, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition

[3]

[Anonymous], P IEEE INT C COMP VI

[4]

[Anonymous], 2003, PROC IEEE INTERNATIO

[5]

[Anonymous], 2006, IEEE COMP SOC C COMP

[6] Activity recognition from user-annotated acceleration data [J].

Bao, L ;

Intille, SS .

PERVASIVE COMPUTING, PROCEEDINGS, 2004, 3001 :1-17

[7] Speeded-Up Robust Features (SURF) [J].

Bay, Herbert ;

Ess, Andreas ;

Tuytelaars, Tinne ;

Van Gool, Luc .

COMPUTER VISION AND IMAGE UNDERSTANDING, 2008, 110 (03) :346-359

[8] The recognition of human movement using temporal templates [J].

Bobick, AF ;

Davis, JW .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (03) :257-267

[9]

Duong TV, 2005, PROC CVPR IEEE, P838

[10] W4:: Real-time surveillance of people and their activities [J].

Haritaoglu, I ;

Harwood, D ;

Davis, LS .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (08) :809-830

← 1 2 3 →