Coarse-to-fine Animal Pose and Shape Estimation

被引:0
|
作者
Li, Chen [1 ]
Lee, Gim Hee [1 ]
机构
[1] Natl Univ Singapore, Dept Comp Sci, Singapore, Singapore
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021) | 2021年 / 34卷
基金
新加坡国家研究基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most existing animal pose and shape estimation approaches reconstruct animal meshes with a parametric SMAL model. This is because the low-dimensional pose and shape parameters of the SMAL model makes it easier for deep networks to learn the high-dimensional animal meshes. However, the SMAL model is learned from scans of toy animals with limited pose and shape variations, and thus may not be able to represent highly varying real animals well. This may result in poor fittings of the estimated meshes to the 2D evidences, e.g. 2D keypoints or silhouettes. To mitigate this problem, we propose a coarse-to-fine approach to reconstruct 3D animal mesh from a single image. The coarse estimation stage first estimates the pose, shape and translation parameters of the SMAL model. The estimated meshes are then used as a starting point by a graph convolutional network (GCN) to predict a per-vertex deformation in the refinement stage. This combination of SMAL-based and vertex-based representations benefits from both parametric and non-parametric representations. We design our mesh refinement GCN (MRGCN) as an encoderdecoder structure with hierarchical feature representations to overcome the limited receptive field of traditional GCNs. Moreover, we observe that the global image feature used by existing animal mesh reconstruction works is unable to capture detailed shape information for mesh refinement. We thus introduce a local feature extractor to retrieve a vertex-level feature and use it together with the global feature as the input of the MRGCN. We test our approach on the StanfordExtra dataset and achieve state-of-the-art results. Furthermore, we test the generalization capacity of our approach on the Animal Pose and BADJA datasets. Our code is available at the project website(1).
引用
收藏
页数:12
相关论文
共 50 条
  • [21] C2F-CCPE: Coarse-to-Fine Cross-View Camera Pose Estimation
    Tang, Yong
    Huang, Qiang
    Zhu, Yingying
    2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME 2024, 2024,
  • [22] Coarse-to-Fine Hand-Object Pose Estimation with Interaction-Aware Graph Convolutional Network
    Zhang, Maomao
    Li, Ao
    Liu, Honglei
    Wang, Minghui
    SENSORS, 2021, 21 (23)
  • [23] A Coarse-to-Fine Indoor Layout Estimation (CFILE) Method
    Ren, Yuzhuo
    Li, Shangwen
    Chen, Chen
    Kuo, C. -C. Jay
    COMPUTER VISION - ACCV 2016, PT V, 2017, 10115 : 36 - 51
  • [24] A novel coarse-to-fine search algorithm for motion estimation
    Korah, Reeba
    2006 IEEE International Conference on Industrial Technology, Vols 1-6, 2006, : 1488 - 1493
  • [25] Coarse-to-Fine Homography Estimation for Infrared and Visible Images
    Wang, Xingyi
    Luo, Yinhui
    Fu, Qiang
    He, Yuanqing
    Shu, Chang
    Wu, Yuezhou
    Liao, Yanhao
    ELECTRONICS, 2023, 12 (21)
  • [26] DPDFormer: A Coarse-to-Fine Model for Monocular Depth Estimation
    Liu, Chunpu
    Yang, Guanglei
    Zuo, Wangmeng
    Zang, Tianyi
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (05)
  • [27] Coarse-to-Fine Combinatorial Matching for Dense Isometric Shape Correspondence
    Sahillioglu, Y.
    Yemez, Y.
    COMPUTER GRAPHICS FORUM, 2011, 30 (05) : 1461 - 1470
  • [28] Coarse-to-fine multiscale affine invariant shape matching and classification
    El Rube, IA
    Ahmed, M
    Kamel, M
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, 2004, : 163 - 166
  • [29] Coarse-to-Fine Isometric Shape Correspondence by Tracking Symmetric Flips
    Sahillioglu, Y.
    Yemez, Y.
    COMPUTER GRAPHICS FORUM, 2013, 32 (01) : 177 - 189
  • [30] Robust Category-Level 6D Pose Estimation with Coarse-to-Fine Rendering of Neural Features
    Ma, Wufei
    Wang, Angtian
    Yuille, Alan
    Kortylewski, Adam
    COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 : 492 - 508