Learning orthographic transformations for object recognition

被引:0
|
作者
Bebis, G
Georgiopoulos, M
Bhatia, S
机构
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we consider the problem of learning to predict the correct pose of a 3D object, assuming orthographic projection and 3D linear transformations. A neural network is trained to learn the desired mapping. First, we consider the problem of predicting all possible views that an object can produce. This is performed by representing the object with a small number of reference views and using algebraic functions of views to construct the space of all possible views that the object can produce. Fundamental to this procedure is a methodology based on Singular Value Decomposition and interval Arithmetic for estimating of the ranges of values that the parameters of algebraic functions can assume. Then, a neural network is trained using a number of views (training views) which are generated by sampling the space of views of the object. During learning, a training view is presented to the inputs of the network which is required to respond at its outputs with the parameters of the algebraic functions used to generate the view from the reference views. Compared to similar approaches in the literature, the proposed approach has the advantage that it does not require the 3D models of the objects or a large number of views, it is extendible to other types of projections, and it is more practical for object recognition.
引用
收藏
页码:3576 / 3581
页数:6
相关论文
共 50 条
  • [1] An Orthographic Descriptor for 3D Object Learning and Recognition
    Kasaei, S. Hamidreza
    Lopes, Luis Seabra
    Tome, Ana Maria
    Oliveira, Miguel
    2016 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2016), 2016, : 4158 - 4163
  • [2] Learning invariant object recognition in the visual system with continuous transformations
    Stringer, SM
    Perry, G
    Rolls, ET
    Proske, JH
    BIOLOGICAL CYBERNETICS, 2006, 94 (02) : 128 - 142
  • [3] Learning invariant object recognition in the visual system with continuous transformations
    S. M. Stringer
    G. Perry
    E. T. Rolls
    J. H. Proske
    Biological Cybernetics, 2006, 94 : 128 - 142
  • [4] Coordinate transformations in object recognition
    Graf, Markus
    PSYCHOLOGICAL BULLETIN, 2006, 132 (06) : 920 - 945
  • [5] ORTHOGRAPHIC DISTINCTIVENESS OF CONSONANTS AND RECOGNITION LEARNING
    KAUSLER, DH
    PAVUR, EJ
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 1974, 102 (03): : 435 - 438
  • [6] GOOD: A global orthographic object descriptor for 3D object recognition and manipulation
    Kasaei, S. Hamidreza
    Tome, Ana Maria
    Lopes, Luis Seabra
    Oliveira, Miguel
    PATTERN RECOGNITION LETTERS, 2016, 83 : 312 - 320
  • [7] ORTHOGRAPHIC REDUNDANCY IN LETTER RECOGNITION - ORTHOGRAPHIC NEIGHBORHOOD OR ORTHOGRAPHIC CONTEXT
    DYDEWALLE, G
    AUWERS, T
    EUROPEAN JOURNAL OF COGNITIVE PSYCHOLOGY, 1994, 6 (03): : 287 - 310
  • [8] Learning features for object recognition
    Lin, YQ
    Bhanu, B
    GENETIC AND EVOLUTIONARY COMPUTATION - GECCO 2003, PT II, PROCEEDINGS, 2003, 2724 : 2227 - 2239
  • [9] Object detection and recognition by learning
    Choksuriwong, Anant
    Emile, Bruno
    Laurent, Helene
    2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 795 - 798
  • [10] Learning models for object recognition
    Felzenszwalb, PF
    2001 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2001, : 1056 - 1062