Human Pose Estimation by a Series of Residual Auto-Encoders

被引:2
|
作者
Farrajota, M. [1 ]
Rodrigues, Joao M. F. [1 ]
du Buf, J. M. H. [1 ]
机构
[1] Univ Algarve, LARSyS, Vis Lab, P-8005139 Faro, Portugal
关键词
Human pose; ConvNet; Neural networks; Auto-encoders;
D O I
10.1007/978-3-319-58838-4_15
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pose estimation is the task of predicting the pose of an object in an image or in a sequence of images. Here, we focus on articulated human pose estimation in scenes with a single person. We employ a series of residual auto-encoders to produce multiple predictions which are then combined to provide a heatmap prediction of body joints. In this network topology, features are processed across all scales which captures the various spatial relationships associated with the body. Repeated bottom-up and top-down processing with intermediate supervision for each auto-encoder network is applied. We propose some improvements to this type of regression-based networks to further increase performance, namely: (a) increase the number of parameters of the auto-encoder networks in the pipeline, (b) use stronger regularization along with heavy data augmentation, (c) use sub-pixel precision for more precise joint localization, and (d) combine all auto-encoders output heatmaps into a single prediction, which further increases body joint prediction accuracy. We demonstrate state-of-the-art results on the popular FLIC and LSP datasets.
引用
收藏
页码:131 / 139
页数:9
相关论文
共 50 条
  • [41] Complete Stacked Denoising Auto-Encoders for Regression
    Fernandez-Garcia, Maria-Elena
    Sancho-Gomez, Jose-Luis
    Ros-Ros, Antonio
    Figueiras-Vidal, Anibal R.
    NEURAL PROCESSING LETTERS, 2021, 53 (01) : 787 - 797
  • [42] Dual Rejection Sampling for Wasserstein Auto-Encoders
    Hou, Liang
    Shenh, Huawei
    Cheng, Xueqi
    ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 1190 - 1197
  • [43] Bankruptcy Prediction Using Stacked Auto-Encoders
    Soui, Makram
    Smiti, Salima
    Mkaouer, Mohamed Wiem
    Ejbali, Ridha
    APPLIED ARTIFICIAL INTELLIGENCE, 2020, 34 (01) : 80 - 100
  • [44] Marginalized Denoising Auto-encoders for Nonlinear Representations
    Chen, Minmin
    Weinberger, Kilian
    Sha, Fei
    Bengio, Yoshua
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 2), 2014, 32 : 1476 - 1484
  • [45] Improved Denoising Auto-encoders for Image Denoising
    Xiang, Qian
    Pang, Xuliang
    2018 11TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2018), 2018,
  • [46] Cascaded Denoising Convolutional Auto-Encoders for Automatic Recovery of Missing Time Series Data
    Chen, Yuanyi
    Wang, Yubin
    Yang, Qiang
    2020 19TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS FOR BUSINESS ENGINEERING AND SCIENCE (DCABES 2020), 2020, : 283 - 286
  • [47] TP-AE: Temporally Primed 6D Object Pose Tracking with Auto-Encoders
    Zheng, Linfang
    Leonardis, Ales
    Tse, Tze Ho Elden
    Horanyi, Nora
    Chen, Hua
    Zhang, Wei
    Chang, Hyung Jin
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 10616 - 10623
  • [48] An Auditory Measure for Anomaly Detection based on Auto-encoders
    Liu, Tao
    Duan, Meiqian
    Sun, Luyang
    Zhang, Bo
    2022 ASIA CONFERENCE ON ALGORITHMS, COMPUTING AND MACHINE LEARNING (CACML 2022), 2022, : 109 - 114
  • [49] Embarrassingly shallow auto-encoders for dynamic collaborative filtering
    Olivier Jeunen
    Jan Van Balen
    Bart Goethals
    User Modeling and User-Adapted Interaction, 2022, 32 : 509 - 541
  • [50] Automatic selection of latent variables in variational auto-encoders
    Jouffroy, Emma
    Giremus, Audrey
    Berthoumieu, Yannick
    Bach, Olivier
    Hugget, Alain
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 1407 - 1411