Joint Object Recognition and Pose Estimation using a Nonlinear View-Invariant Latent Generative Model

被引:0
|
作者
Bakry, Amr [1 ]
Elgaaly, Tarek [1 ]
Elhoseiny, Mohamed [1 ]
Elgammal, Ahmed [1 ]
机构
[1] Rutgers State Univ, Dept Comp Sci, New Brunswick, NJ 08901 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object recognition and pose estimation are two fundamental problems in the field of computer vision. Recognizing objects and their poses/viewpoints are critical components of ample vision and robotic systems. Multiple viewpoints of an object lie on an intrinsic low-dimensional manifold in the input space (i.e. descriptor space). Different objects captured from the same set of viewpoints have manifolds with a common topology. In this paper we utilize this common topology between object manifolds by learning a low-dimensional latent space which non-linearly maps between a common unified manifold and the object manifold in the input space. Using a supervised embedding approach, the latent space is computed and used to jointly infer the category and pose of objects. We empirically validate our model by using multiple inference approaches and testing on multiple challenging datasets. We compare our results with the state-of-the-art and present our increased category recognition and pose estimation accuracy.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] View-invariant Feature using Pose Information and Flexible Matching Algorithm for Action Retrieval
    Yoshida, Noboru
    Liu, Jianquan
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 1556 - 1562
  • [32] Image-based shape model for view-invariant human motion recognition
    Jin, Ning
    Mokhtarian, Farzin
    2007 IEEE CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE, 2007, : 336 - 341
  • [33] Development of view-invariant object recognition capability without view-invariance learning of the objects: Neuronal substrates
    Okamura, Jun-ya
    Ikejiri, Yuta
    Yamaguchi, Reona
    Wang, Gang
    Tanaka, Keiji
    NEUROSCIENCE RESEARCH, 2011, 71 : E49 - E49
  • [34] Pose Invariant Object Recognition Using a Bag of Words Approach
    Costa, Carlos M.
    Sousa, Armando
    Veiga, Germano
    ROBOT 2017: THIRD IBERIAN ROBOTICS CONFERENCE, VOL 2, 2018, 694 : 153 - 164
  • [35] A novel probabilistic model for object recognition and pose estimation
    Hornegger, J
    Niemann, H
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2001, 15 (02) : 241 - 253
  • [36] A view-invariant gait recognition algorithm based on a joint-direct linear discriminant analysis
    Portillo-Portillo, Jose
    Leyva, Roberto
    Sanchez, Victor
    Sanchez-Perez, Gabriel
    Perez-Meana, Hector
    Olivares-Mercado, Jesus
    Toscano-Medina, Karina
    Nakano-Miyatake, Mariko
    APPLIED INTELLIGENCE, 2018, 48 (05) : 1200 - 1217
  • [37] Untangling Object-View Manifold for Multiview Recognition and Pose Estimation
    Bakry, Amr
    Elgammal, Ahmed
    COMPUTER VISION - ECCV 2014, PT IV, 2014, 8692 : 434 - 449
  • [38] Efficient Multi-View Object Recognition and Full Pose Estimation
    Collet, Alvaro
    Srinivasa, Siddhartha S.
    2010 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2010, : 2050 - 2055
  • [39] A view-invariant gait recognition algorithm based on a joint-direct linear discriminant analysis
    Jose Portillo-Portillo
    Roberto Leyva
    Victor Sanchez
    Gabriel Sanchez-Perez
    Hector Perez-Meana
    Jesus Olivares-Mercado
    Karina Toscano-Medina
    Mariko Nakano-Miyatake
    Applied Intelligence, 2018, 48 : 1200 - 1217
  • [40] Object recognition and pose estimation using appearance manifolds
    Zhong-Hua Hao
    Shi-Wei Ma
    Advances in Manufacturing, 2013, 1 : 258 - 264