An Efficient Approach Using Knowledge Distillation Methods to Stabilize Performance in a Lightweight Top-Down Posture Estimation Network

被引:2
作者
Park, Changhyun [1 ]
Lee, Hean Sung [1 ]
Kim, Woo Jin [1 ]
Bae, Han Byeol [2 ]
Lee, Jaeho [1 ]
Lee, Sangyoun [1 ]
机构
[1] Yonsei Univ, Dept Elect & Elect Engn, Seoul 03722, South Korea
[2] Kwangju Womens Univ, Dept Artificial Intelligence Convergence, Gwangju 62396, South Korea
关键词
pose estimation; convolutional neural network; lightweight; knowledge distillation; POSE ESTIMATION; SYSTEM;
D O I
10.3390/s21227640
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Multi-person pose estimation has been gaining considerable interest due to its use in several real-world applications, such as activity recognition, motion capture, and augmented reality. Although the improvement of the accuracy and speed of multi-person pose estimation techniques has been recently studied, limitations still exist in balancing these two aspects. In this paper, a novel knowledge distilled lightweight top-down pose network (KDLPN) is proposed that balances computational complexity and accuracy. For the first time in multi-person pose estimation, a network that reduces computational complexity by applying a "Pelee" structure and shuffles pixels in the dense upsampling convolution layer to reduce the number of channels is presented. Furthermore, to prevent performance degradation because of the reduced computational complexity, knowledge distillation is applied to establish the pose estimation network as a teacher network. The method performance is evaluated on the MSCOCO dataset. Experimental results demonstrate that our KDLPN network significantly reduces 95% of the parameters required by state-of-the-art methods with minimal performance degradation. Moreover, our method is compared with other pose estimation methods to substantiate the importance of computational complexity reduction and its effectiveness.
引用
收藏
页数:18
相关论文
共 62 条
  • [41] Optimizing Distributed Actor Systems for Dynamic Interactive Services
    Newell, Andrew
    Kliot, Gabriel
    Menache, Ishai
    Gopalan, Aditya
    Akiyama, Soramichi
    Silberstein, Mark
    [J]. PROCEEDINGS OF THE ELEVENTH EUROPEAN CONFERENCE ON COMPUTER SYSTEMS, (EUROSYS 2016), 2016,
  • [42] Knowledge-Guided Deep Fractal Neural Networks for Human Pose Estimation
    Ning, Guanghan
    Zhang, Zhi
    He, Zhiquan
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (05) : 1246 - 1259
  • [43] Towards Accurate Multi-person Pose Estimation in the Wild
    Papandreou, George
    Zhu, Tyler
    Kanazawa, Nori
    Toshev, Alexander
    Tompson, Jonathan
    Bregler, Chris
    Murphy, Kevin
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3711 - 3719
  • [44] DeepCut: Joint Subset Partition and Labeling for Multi Person Pose Estimation
    Pishchulin, Leonid
    Insafutdinov, Eldar
    Tang, Siyu
    Andres, Bjoern
    Andriluka, Mykhaylo
    Gehler, Peter
    Schiele, Bernt
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 4929 - 4937
  • [45] Ren S., 2015, PROC ADVNEURAL INF P, P91
  • [46] Romero A., 2014, ARXIV14126550
  • [47] A Survey of Marker-Less Tracking and Registration Techniques for Health & Environmental Applications to Augmented Reality and Ubiquitous Geospatial Information Systems
    Sadeghi-Niaraki, Abolghasem
    Choi, Soo-Mi
    [J]. SENSORS, 2020, 20 (10)
  • [48] MobileNetV2: Inverted Residuals and Linear Bottlenecks
    Sandler, Mark
    Howard, Andrew
    Zhu, Menglong
    Zhmoginov, Andrey
    Chen, Liang-Chieh
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4510 - 4520
  • [49] Tan MX, 2019, PROC CVPR IEEE, P2815, DOI [arXiv:1807.11626, 10.1109/CVPR.2019.00293]
  • [50] Deeply Learned Compositional Models for Human Pose Estimation
    Tang, Wei
    Yu, Pei
    Wu, Ying
    [J]. COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 : 197 - 214