ArtTrack: Articulated Multi-person Tracking in the Wild

被引:136
作者
Insafutdinov, Eldar [1 ]
Andriluka, Mykhaylo [1 ]
Pishchulin, Leonid [1 ]
Tang, Siyu [1 ]
Levinkov, Evgeny [1 ]
Andres, Bjoern [1 ]
Schiele, Bernt [1 ]
机构
[1] Max Planck Inst Informat, Saarland Informat Campus, Saarbrucken, Germany
来源
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) | 2017年
关键词
D O I
10.1109/CVPR.2017.142
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we propose an approach for articulated tracking of multiple people in unconstrained videos. Our starting point is a model that resembles existing architectures for single-frame pose estimation but is substantially faster. We achieve this in two ways: (1) by simplifying and sparsifying the body-part relationship graph and leveraging recent methods for faster inference, and (2) by offloading a substantial share of computation onto a feed-forward convolutional architecture that is able to detect and associate body joints of the same person even in clutter. We use this model to generate proposals for body joint locations and formulate articulated tracking as spatio-temporal grouping of such proposals. This allows to jointly solve the association problem for all people in the scene by propagating evidence from strong detections through time and enforcing constraints that each proposal can be assigned to one person only. We report results on a public "MPII Human Pose" benchmark and on a new "MPII Video Pose" dataset of image sequences with multiple people. We demonstrate that our model achieves state-of-the-art results while using only a fraction of time and is able to leverage temporal information to improve state-of-the-art for crowded scenes(1).
引用
收藏
页码:1293 / 1301
页数:9
相关论文
共 30 条
  • [1] Andriluka M., CVPR 08
  • [2] Andriluka M., CVPR 14
  • [3] Monocular 3D Pose Estimation and Tracking by Detection
    Andriluka, Mykhaylo
    Roth, Stefan
    Schiele, Bernt
    [J]. 2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 623 - 630
  • [4] [Anonymous], ICCV 13
  • [5] Evaluating Multiple Object Tracking Performance: The CLEAR MOT Metrics
    Bernardin, Keni
    Stiefelhagen, Rainer
    [J]. EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2008, 2008 (1)
  • [6] Bulat A., ECCV 16
  • [7] Charles J., CVPR 16
  • [8] Cherian A., CVPR 14
  • [9] Eichner M., ECCV 10
  • [10] Elhayek A., CVPR 15