AlphaPose: Whole-Body Regional Multi-Person Pose Estimation and Tracking in Real-Time

被引:290
作者
Fang, Hao-Shu [1 ]
Li, Jiefeng [1 ]
Tang, Hongyang [1 ]
Xu, Chao [1 ]
Zhu, Haoyi [1 ]
Xiu, Yuliang [2 ]
Li, Yong-Lu [1 ]
Lu, Cewu [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Elect & Comp Engn, Shanghai 200240, Peoples R China
[2] Max Planck Inst Intelligent Syst, Perceiving Syst, D-70569 Stuttgart, Germany
关键词
Pose estimation; Detectors; Heating systems; Faces; Proposals; Location awareness; Training; Human pose estimation; pose tracking; whole-body pose estimation; hand pose estimation; realtime; multi-person;
D O I
10.1109/TPAMI.2022.3222784
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accurate whole-body multi-person pose estimation and tracking is an important yet challenging topic in computer vision. To capture the subtle actions of humans for complex behavior analysis, whole-body pose estimation including the face, body, hand and foot is essential over conventional body-only pose estimation. In this article, we present AlphaPose, a system that can perform accurate whole-body pose estimation and tracking jointly while running in realtime. To this end, we propose several new techniques: Symmetric Integral Keypoint Regression (SIKR) for fast and fine localization, Parametric Pose Non-Maximum-Suppression (P-NMS) for eliminating redundant human detections and Pose Aware Identity Embedding for jointly pose estimation and tracking. During training, we resort to Part-Guided Proposal Generator (PGPG) and multi-domain knowledge distillation to further improve the accuracy. Our method is able to localize whole-body keypoints accurately and tracks humans simultaneously given inaccurate bounding boxes and redundant detections. We show a significant improvement over current state-of-the-art methods in both speed and accuracy on COCO-wholebody, COCO, PoseTrack, and our proposed Halpe-FullBody pose estimation dataset. Our model, source codes and dataset are made publicly available at https://github.com/MVIG-SJTU/AlphaPose.
引用
收藏
页码:7157 / 7173
页数:17
相关论文
共 89 条
[1]   PoseTrack: A Benchmark for Human Pose Estimation and Tracking [J].
Andriluka, Mykhaylo ;
Iqbal, Umar ;
Insafutdinov, Eldar ;
Pishchulin, Leonid ;
Milan, Anton ;
Gall, Juergen ;
Schiele, Bernt .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :5167-5176
[2]   2D Human Pose Estimation: New Benchmark and State of the Art Analysis [J].
Andriluka, Mykhaylo ;
Pishchulin, Leonid ;
Gehler, Peter ;
Schiele, Bernt .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :3686-3693
[3]   Pose-Guided Tracking-by-Detection: Robust Multi-Person Pose Tracking [J].
Bao, Qian ;
Liu, Wu ;
Cheng, Yuhao ;
Zhou, Boyan ;
Mei, Tao .
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 :161-175
[4]   PandaNet: Anchor-Based Single-Shot Multi-Person 3D Pose Estimation [J].
Benzine, Abdallah ;
Chabot, Florian ;
Luvison, Bertrand ;
Quoc Cuong Pham ;
Achard, Catherine .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :6855-6864
[5]  
Bertasius G., 2019, NeurIPS, P3021
[6]   Soft-NMS - Improving Object Detection With One Line of Code [J].
Bodla, Navaneeth ;
Singh, Bharat ;
Chellappa, Rama ;
Davis, Larry S. .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :5562-5570
[7]   Merging Pose Estimates Across Space and Time [J].
Burgos-Artizzu, Xavier P. ;
Hall, David ;
Perona, Pietro ;
Dollar, Piotr .
PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2013, 2013,
[8]   Robust face landmark estimation under occlusion [J].
Burgos-Artizzu, Xavier P. ;
Perona, Pietro ;
Dollar, Piotr .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :1513-1520
[9]  
Cao Z, 2019, Arxiv, DOI [arXiv:1812.08008, 10.48550/arXiv.1812.08008]
[10]   Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields [J].
Cao, Zhe ;
Simon, Tomas ;
Wei, Shih-En ;
Sheikh, Yaser .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1302-1310