Joint Object Recognition and Pose Estimation using a Nonlinear View-Invariant Latent Generative Model

被引:0
|
作者
Bakry, Amr [1 ]
Elgaaly, Tarek [1 ]
Elhoseiny, Mohamed [1 ]
Elgammal, Ahmed [1 ]
机构
[1] Rutgers State Univ, Dept Comp Sci, New Brunswick, NJ 08901 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object recognition and pose estimation are two fundamental problems in the field of computer vision. Recognizing objects and their poses/viewpoints are critical components of ample vision and robotic systems. Multiple viewpoints of an object lie on an intrinsic low-dimensional manifold in the input space (i.e. descriptor space). Different objects captured from the same set of viewpoints have manifolds with a common topology. In this paper we utilize this common topology between object manifolds by learning a low-dimensional latent space which non-linearly maps between a common unified manifold and the object manifold in the input space. Using a supervised embedding approach, the latent space is computed and used to jointly infer the category and pose of objects. We empirically validate our model by using multiple inference approaches and testing on multiple challenging datasets. We compare our results with the state-of-the-art and present our increased category recognition and pose estimation accuracy.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] View-invariant recognition using corresponding object fragments
    Bart, E
    Byvatov, E
    Ullman, S
    COMPUTER VISION - ECCV 2004, PT 2, 2004, 3022 : 152 - 165
  • [2] VIEW-INVARIANT OBJECT RECOGNITION USING HOMOGRAPHY CONSTRAINTS
    Lotfian, Sina
    Foroosh, Hassan
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 605 - 609
  • [3] View-Invariant Pose Recognition Using Multilinear Analysis and the Universum
    Peng, Bo
    Qian, Gang
    Ma, Yunqian
    ADVANCES IN VISUAL COMPUTING, PT II, PROCEEDINGS, 2008, 5359 : 581 - +
  • [4] View-Invariant Action Recognition Using Latent Kernelized Structural SVM
    Wu, Xinxiao
    Jia, Yunde
    COMPUTER VISION - ECCV 2012, PT V, 2012, 7576 : 411 - 424
  • [5] Latent Multitask Learning for View-Invariant Action Recognition
    Mahasseni, Behrooz
    Todorovic, Sinisa
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 3128 - 3135
  • [6] View-Invariant Gait Recognition Using a Joint-DLDA Framework
    Portillo, Jose
    Leyva, Roberto
    Sanchez, Victor
    Sanchez, Gabriel
    Perez-Meana, Hector
    Olivares, Jesus
    Toscano, Karina
    Nakano, Mariko
    TRENDS IN APPLIED KNOWLEDGE-BASED SYSTEMS AND DATA SCIENCE, 2016, 9799 : 398 - 408
  • [7] Factorization of view-object manifolds for joint object recognition and pose estimation
    Zhang, Haopeng
    El-Gaaly, Tarek
    Elgammal, Ahmed
    Jiang, Zhiguo
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2015, 139 : 89 - 103
  • [8] Joint Subspace Learning for View-Invariant Gait Recognition
    Liu, Nini
    Lu, Jiwen
    Tan, Yap-Peng
    IEEE SIGNAL PROCESSING LETTERS, 2011, 18 (07) : 431 - 434
  • [9] VIEW-INVARIANT ACTION RECOGNITION FROM RGB DATA VIA 3D POSE ESTIMATION
    Baptista, Renato
    Ghorbel, Enjie
    Papadopoulos, Konstantinos
    Demisse, Girum G.
    Aouada, Djamila
    Ottersten, Bjorn
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 2542 - 2546
  • [10] A developmental dissociation of view-dependent and view-invariant object recognition in adolescence
    Juettner, Martin
    Mueller, Alexander
    Rentschler, Ingo
    BEHAVIOURAL BRAIN RESEARCH, 2006, 175 (02) : 420 - 424