Integrated person tracking using stereo, color, and pattern detection

被引:140
作者
Darrell, T [1 ]
Gordon, G [1 ]
Harville, M [1 ]
Woodfill, J [1 ]
机构
[1] Interval Res Corp, Palo Alto, CA 94304 USA
关键词
face detection; tracking and recognition; human-computer interface; frame-rate stereo; multi-modal integration;
D O I
10.1023/A:1008103604354
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present an approach to real-time person tracking in crowded and/or unknown environments using integration of multiple visual modalities. We combine stereo, color, and face detection modules into a single robust system, and show an initial application in an interactive, face-responsive display. Dense, real-time stereo processing is used to isolate users from other objects and people in the background. Skin-hue classification identifies and tracks likely body parts within the silhouette of a user. Face pattern detection discriminates and localizes the face within the identified body parts. Faces and bodies of users are tracked over several temporal scales: short-term (user stays within the field of view), medium-term (user exits/reenters within minutes), and long term (user returns after hours or days). Short-term tracking is performed using simple region position and size correspondences, while medium and long-term tracking are based on statistics of user appearance. We discuss the failure modes of each individual module, describe our integration method, and report results with the complete system in trials with thousands of users.
引用
收藏
页码:175 / 185
页数:11
相关论文
共 15 条
  • [1] [Anonymous], 1994, Proceedings of ECCV
  • [2] [Anonymous], 1996, P EUR C COMP VIS
  • [3] DARRELL T, 1997, SIGGRAPH 97 VIS P
  • [4] DARRELL T, 1994, IEEE WORKSH VIS BEH
  • [5] DARRELL T, 1999, SIGGRAPH 98 VIS P
  • [6] Isard M., 1998, Proc. 5th Europe Conference on Computer Vision, P893
  • [7] KANADE T, 1996, COMP VIS PATT REC C
  • [8] MAES P, 1996, ACM MULTIMEDIA SYSTE
  • [9] POGGIO T, 1994, P IM UND WORKSH, V2, P843
  • [10] REGH J, 1997, P IEEE C COMP VIS PA, P690