Metric learning based object recognition and retrieval

被引:14
作者
Yang, Jianyu [1 ]
Xu, Haoran [1 ]
机构
[1] Soochow Univ, Sch Urban Rail Transportat, 8 Jixue Rd, Suzhou, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Metric learning; Object recognition; Object retrieval; Robot learning; Intelligent analysis; NONRIGID SHAPES; REPRESENTATION; ROBUST; DESCRIPTORS; SYSTEMS;
D O I
10.1016/j.neucom.2016.01.032
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object recognition and retrieval is an important topic in intelligent robotics and pattern recognition, where an effective recognition engine plays an important role. To achieve a good performance, we propose a metric learning based object recognition algorithm. To represent the invariant object features, including local shape details and global body parts, a novel multi-scale invariant descriptor is proposed. Different types of invariant features are represented in multiple scales, which makes the following metric learning algorithm effective. To reduce the effect of noise and improve the computing efficiency, an adaptive discrete contour evolution method is also proposed to extract the salient feature points of object. The recognition algorithm is explored based on metric learning method and the object features are summarized as histograms inspired from the Bag of Words (BoW). The metric learning methods are employed to learn object features according to their scales. The proposed method is invariant to rotation, scale variation, intra-class variation, articulated deformation and partial occlusion. The recognition process is fast and robust for noise. This method is evaluated on multiple benchmark datasets and the comparable experimental results indicate the effectiveness of our method. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:70 / 81
页数:12
相关论文
共 50 条
  • [1] A multiscale representation method for nonrigid shapes with a single closed contour
    Adamek, T
    O'Connor, NE
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2004, 14 (05) : 742 - 753
  • [2] Alajlan N., 2007, PATTERN RECOGNIT, V40
  • [3] Geometry-based image retrieval in binary image databases
    Alajlan, Naif
    Kamel, Mohamed S.
    Freeman, George H.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (06) : 1003 - 1013
  • [4] Shape retrieval using triangle-area representation and dynamic space warping
    Alajlan, Naif
    El Rube, Ibrahim
    Kamel, Mohamed S.
    Freeman, George
    [J]. PATTERN RECOGNITION, 2007, 40 (07) : 1911 - 1920
  • [5] [Anonymous], 2003, ICML
  • [6] [Anonymous], 1997, Image Databases and Multi-Media Search, DOI DOI 10.1142/9789812797988_
  • [7] [Anonymous], 2004, WORKSH STAT LEARN CO
  • [8] Robust shape similarity retrieval based on contour segmentation polygonal multiresolution and elastic matching
    Attalla, E
    Siy, P
    [J]. PATTERN RECOGNITION, 2005, 38 (12) : 2229 - 2241
  • [9] Path similarity skeleton graph matching
    Bai, Xiang
    Latecki, Longin Jan
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (07) : 1282 - 1292
  • [10] Shape Vocabulary: A Robust and Efficient Shape Representation for Shape Matching
    Bai, Xiang
    Rao, Cong
    Wang, Xinggang
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (09) : 3935 - 3949