2D human pose estimation using multi-level dynamic model

被引:0
|
作者
Ma M. [1 ]
Li Y. [1 ]
机构
[1] School of Control Science and Engineering, Shandong University, Ji'nan
来源
Li, Yibin (liyb@sdu.edu.cn) | 1600年 / Chinese Academy of Sciences卷 / 38期
关键词
Human activity understanding; Human pose estimation; Multi-level dynamic model; Video understanding;
D O I
10.13973/j.cnki.robot.2016.0578
中图分类号
学科分类号
摘要
A human pose estimation algorithm with a multi-level dynamic model for monocular videos is presented. Firstly, a multi-level dynamic model of human pose is constructed to decompose human entire pose into articulated pose parts, and approach optimal human pose candidates by optimizing pose parts candidates. This model solves the ambiguity problem caused by the entire pose estimation method. Secondly, an algorithm for calculating the pose consistency between the adjacent video frames is proposed by constructing virtual poses. This method can make use of the continuity of appearance features and motion features between the adjacent frames to improve the estimation accuracy. Thirdly, particle swarm optimization method is utilized to search for the best pose parts candidates with a small amount of candidates, and then the achieved pose parts are recomposed into the optimal human entire poses. The efficiency of the proposed method is tested and experimentally compared with several related state-of-the-art methods on challenging video sequences, which shows significant improvements. © 2016, Science Press. All right reserved.
引用
收藏
页码:578 / 587
页数:9
相关论文
共 28 条
  • [1] Dautenhahn K., Socially intelligent robots: Dimensions of human-robot interaction, Philosophical Transactions of the Royal Society of London, B: Biological Sciences, 362, 1480, pp. 679-704, (2007)
  • [2] Atkeson C.G., Hale J.G., Pollick F.E., Et al., Using humanoid robots to study human behavior, IEEE Intelligent Systems and Their Applications, 15, 4, pp. 46-55, (2000)
  • [3] Tian G.H., Yin J.Q., Han X., Et al., A novel human activity recognition method using joint points information, Robot, 36, 3, pp. 285-292, (2014)
  • [4] Yang Y.Z., Li Y., Fermuller C., Et al., Robot learning manipulation action plans by "watching" unconstrained videos from the World Wide Web, Proceedings of the 29th AAAI Conference on Artificial Intelligence, pp. 3686-3693, (2015)
  • [5] Koppula H.S., Gupta R., Saxena A., Learning human activities and object affordances from RGB-D videos, International Journal of Robotics Research, 32, 8, pp. 951-970, (2013)
  • [6] Yang Y., Ramanan D., Articulated human detection with flexible mixtures of parts, IEEE Transactions on Pattern Analysis and Machine Intelligence, 35, 12, pp. 2878-2890, (2013)
  • [7] Dantone M., Gall J., Leistner C., Et al., Human pose estimation using body parts dependent joint regressors, IEEE Conference on Computer Vision and Pattern Recognition, pp. 3041-3048, (2013)
  • [8] Fischler M.A., Elschlager R.A., The representation and matching of pictorial structures, IEEE Transactions on Computers, 22, 1, pp. 67-92, (1973)
  • [9] Freifeld O., Weiss A., Zuffi S., Et al., Contour people: A parameterized model of 2D articulated human shape, IEEE Conference on Computer Vision and Pattern Recognition, pp. 639-646, (2010)
  • [10] Zuffi S., Freifeld O., Black M.J., From pictorial structures to deformable structures, IEEE Conference on Computer Vision and Pattern Recognition, pp. 3546-3553, (2012)