Skeleton Merger: an Unsupervised Aligned Keypoint Detector

被引:26
|
作者
Shi, Ruoxi [1 ]
Xue, Zhengrong [1 ]
You, Yang [1 ]
Lu, Cewu [1 ,2 ,3 ,4 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[2] Shanghai Jiao Tong Univ, Res Inst, Shanghai, Peoples R China
[3] Shanghai Jiao Tong Univ, MoE Key Lab Artificial Intelligence, AI Inst, Shanghai, Peoples R China
[4] Shanghai Qizhi Res Inst, Shanghai, Peoples R China
来源
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
D O I
10.1109/CVPR46437.2021.00011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting aligned 3D keypoints is essential under many scenarios such as object tracking, shape retrieval and robotics. However, it is generally hard to prepare a high-quality dataset for all types of objects due to the ambiguity of keypoint itself. Meanwhile, current unsupervised detectors are unable to generate aligned keypoints with good coverage. In this paper, we propose an unsupervised aligned keypoint detector, Skeleton Merger, which utilizes skeletons to reconstruct objects. It is based on an Autoencoder architecture. The encoder proposes keypoints and predicts activation strengths of edges between keypoints. The decoder performs uniform sampling on the skeleton and refines it into small point clouds with pointwise offsets. Then the activation strengths are applied and the sub-clouds are merged. Composite Chamfer Distance (CCD) is proposed as a distance between the input point cloud and the reconstruction composed of sub-clouds masked by activation strengths. We demonstrate that Skeleton Merger is capable of detecting semantically-rich salient keypoints with good alignment, and shows comparable performance to supervised methods on the KeypointNet dataset. It is also shown that the detector is robust to noise and subsampling.
引用
收藏
页码:43 / 52
页数:10
相关论文
共 50 条
  • [31] USSD: Unsupervised Sleep Spindle Detector
    Ramirez, Edgardo
    Estevez, Pablo A.
    Adams, Martin D.
    Perez, Claudio A.
    Garrido Gonzalez, Marcelo
    Peirano, Patricio
    IEEE ACCESS, 2025, 13 : 18644 - 18659
  • [32] A Scale-Invariant Keypoint Detector in Log-Polar Space
    Tao, Tao
    Zhang, Yun
    EIGHTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2016), 2017, 10225
  • [33] Learning a Descriptor-specific 3D Keypoint Detector
    Salti, Samuele
    Tombari, Federico
    Spezialetti, Riccardo
    Di Stefano, Luigi
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2318 - 2326
  • [34] Semi-supervised Keypoint Detector and Descriptor for Retinal Image Matching
    Liu, Jiazhen
    Li, Xirong
    Wei, Qijie
    Xu, Jie
    Ding, Dayong
    COMPUTER VISION, ECCV 2022, PT XXI, 2022, 13681 : 593 - 609
  • [35] CVAD - An unsupervised image anomaly detector
    Guo, Xiaoyuan
    Gichoya, Judy Wawira
    Purkayastha, Saptarshi
    Banerjee, Imon
    SOFTWARE IMPACTS, 2022, 11
  • [36] UNSUPERVISED AND ADAPTIVE PERIMETER INTRUSION DETECTOR
    Lohani, Devashish
    Crispim-Junior, Carlos
    Barthelemy, Quentin
    Bertrand, Sarah
    Robinault, Lionel
    Rodet, Laure Tougne
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 3621 - 3625
  • [37] Unsupervised Generative Fake Image Detector
    Qiao T.
    Shao H.
    Xie S.
    Shi R.
    IEEE Transactions on Circuits and Systems for Video Technology, 2024, 34 (09) : 1 - 1
  • [38] KeypointDETR: An End-to-End 3D Keypoint Detector
    Jin, Hairong
    Shen, Yuefan
    Lou, Jianwen
    Zhou, Kun
    Zheng, Youyi
    COMPUTER VISION - ECCV 2024, PT LXXIV, 2025, 15132 : 374 - 390
  • [39] 3D human skeleton keypoint detection using RGB and depth image
    Jeong J.
    Park B.
    Yoon K.
    Transactions of the Korean Institute of Electrical Engineers, 2021, 70 (09): : 1354 - 1361
  • [40] Forward-looking sonar image compression by integrating keypoint clustering and morphological skeleton
    Avola, Danilo
    Bernardi, Marco
    Cinque, Luigi
    Foresti, Gian Luca
    Pannone, Daniele
    Petrioli, Chiara
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (02) : 1625 - 1639