Joint Object Recognition and Pose Estimation using a Nonlinear View-Invariant Latent Generative Model

被引:0
|
作者
Bakry, Amr [1 ]
Elgaaly, Tarek [1 ]
Elhoseiny, Mohamed [1 ]
Elgammal, Ahmed [1 ]
机构
[1] Rutgers State Univ, Dept Comp Sci, New Brunswick, NJ 08901 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object recognition and pose estimation are two fundamental problems in the field of computer vision. Recognizing objects and their poses/viewpoints are critical components of ample vision and robotic systems. Multiple viewpoints of an object lie on an intrinsic low-dimensional manifold in the input space (i.e. descriptor space). Different objects captured from the same set of viewpoints have manifolds with a common topology. In this paper we utilize this common topology between object manifolds by learning a low-dimensional latent space which non-linearly maps between a common unified manifold and the object manifold in the input space. Using a supervised embedding approach, the latent space is computed and used to jointly infer the category and pose of objects. We empirically validate our model by using multiple inference approaches and testing on multiple challenging datasets. We compare our results with the state-of-the-art and present our increased category recognition and pose estimation accuracy.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Cross-View Action Recognition Using View-Invariant Pose Feature Learned from Synthetic Data with Domain Adaptation
    Yang, Yu-Huan
    Liu, An-Sheng
    Liu, Yu-Hung
    Yeh, Tso-Hsin
    Li, Zi-Jun
    Fu, Li-Chen
    COMPUTER VISION - ACCV 2018, PT II, 2019, 11362 : 431 - 446
  • [22] Invariant Object Recognition and Pose Estimation with Slow Feature Analysis
    Franzius, Mathias
    Wilbert, Niko
    Wiskott, Laurenz
    NEURAL COMPUTATION, 2011, 23 (09) : 2289 - 2323
  • [23] A Discriminative Model of Motion and Cross Ratio for View-Invariant Action Recognition
    Huang, Kaiqi
    Zhang, Yeying
    Tan, Tieniu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2012, 21 (04) : 2187 - 2197
  • [24] Neural Substrates of View-Invariant Object Recognition Developed without Experiencing Rotations of the Objects
    Okamura, Jun-ya
    Yamaguchi, Reona
    Honda, Kazunari
    Wang, Gang
    Tanaka, Keiji
    JOURNAL OF NEUROSCIENCE, 2014, 34 (45): : 15047 - 15059
  • [25] GEINet: View-Invariant Gait Recognition Using a Convolutional Neural Network
    Shiraga, Kohei
    Makihara, Yasushi
    Muramatsu, Daigo
    Echigo, Tomio
    Yagi, Yasushi
    2016 INTERNATIONAL CONFERENCE ON BIOMETRICS (ICB), 2016,
  • [26] VIEW-INVARIANT ACTION RECOGNITION USING CROSS RATIOS ACROSS FRAMES
    Zhang, Yeyin
    Huang, Kaiqi
    Huang, Yongzhen
    Tan, Tieniu
    2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 3549 - 3552
  • [27] Development of novel tasks for studying view-invariant object recognition in rodents: Sensitivity to scopolamine
    Mitchnick, Krista A.
    Wideman, Cassidy E.
    Huff, Andrew E.
    Palmer, Daniel
    McNaughton, Bruce L.
    Winters, Boyer D.
    BEHAVIOURAL BRAIN RESEARCH, 2018, 344 : 48 - 56
  • [28] View-Invariant Dynamic Texture Recognition using a Bag of Dynamical Systems
    Ravichandran, Avinash
    Chaudhry, Rizwan
    Vidal, Rene
    CVPR: 2009 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-4, 2009, : 1651 - 1657
  • [29] Object Recognition and Pose Estimation Using KLT
    Kim, Hye-Jin
    Lee, Jae Yeon
    Kim, Jae Hong
    Kim, Joong Bae
    Han, Woo Yong
    2012 12TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS), 2012, : 214 - 217
  • [30] View-Invariant Human Activity Recognition using Topic Model on Combined ORB-OF Feature
    Dinh Tuan Tran
    Yamazoe, Hirotake
    Lee, Joo-Ho
    2019 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION (SII), 2019, : 369 - 374