Multiple camera tracking of interacting and occluded human motion

被引:140
作者
Dockstader, SL [1 ]
Tekalp, AM [1 ]
机构
[1] Univ Rochester, Dept Elect & Comp Engn, Rochester, NY 14627 USA
关键词
Bayesian network; human motion; Kalman filtering; multiple camera fusion; occlusion; real-time tracking;
D O I
10.1109/5.959340
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We propose a distributed, real-time computing platform for tracking multiple interacting persons in motion. To combat the negative effects of occlusion and articulated motion we use a multiview implementation, where each view is first independently processed on a dedicated processor. This monocular processing uses a predictor-corrector filter to weigh reprojections of three-dimensional (3-D) position estimates, obtained by the central processor, against observations of measurable image motion. The corrected state vectors from each view provide input observations to a Bayesian belief network, in the central processor with a dynamic, multidimensional topology that varies as a function of scene content and feature confidence. The Bayesian net fuses independent observations from multiple cameras by iteratively resolving independency relationships and confidence levels within the graph, thereby producing the most likely vector of 3-D state estimates given the available data, To maintain temporal continuity, we follow the network with a layer of Kalman filtering that updates the 3-D state estimates. We demonstrate the efficacy of the proposed system using a multiview sequence of several people in motion. Our experiments suggest that, when compared with data fusion based on averaging, the proposed technique yields a noticeable improvement in tracking accuracy.
引用
收藏
页码:1441 / 1455
页数:15
相关论文
共 53 条
  • [11] Invariant features for 3-D gesture recognition
    Campbell, LW
    Becker, DA
    Azarbayejani, A
    Bobick, AF
    Pentland, A
    [J]. PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, 1996, : 157 - 162
  • [12] CHU GW, 1999, P INT C MULT FUS INT, P261
  • [13] Introduction to the special section on video surveillance
    Collins, RT
    Lipton, AJ
    Kanade, T
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (08) : 745 - 746
  • [14] Integrated person tracking using stereo, color, and pattern detection
    Darrell, T
    Gordon, G
    Harville, M
    Woodfill, J
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2000, 37 (02) : 175 - 185
  • [15] Tracking multiple objects in the presence of articulated and occluded motion
    Dockstader, SL
    Tekalp, AM
    [J]. WORKSHOP ON HUMAN MOTION, PROCEEDINGS, 2000, : 88 - 95
  • [16] The visual analysis of human movement: A survey
    Gavrila, DM
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 1999, 73 (01) : 82 - 98
  • [17] 3-D model-based tracking of humans in action: A multi-view approach
    Gavrila, DM
    Davis, LS
    [J]. 1996 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1996, : 73 - 80
  • [18] W4:: Real-time surveillance of people and their activities
    Haritaoglu, I
    Harwood, D
    Davis, LS
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (08) : 809 - 830
  • [19] Hydra:: Multiple people detection and tracking using silhouettes
    Haritaoglu, I
    Harwood, D
    Davis, LS
    [J]. SECOND IEEE WORKSHOP ON VISUAL SURVEILLANCE (VS'99), PROCEEDINGS, 1999, : 6 - 13
  • [20] HOGG D, 1983, IMAGE VISION COMPUT, V1, P5, DOI DOI 10.1016/0262-8856(83)90003-3