Training-Free, Generic Object Detection Using Locally Adaptive Regression Kernels

被引:102
作者
Seo, Hae Jong [1 ]
Milanfar, Peyman [1 ]
机构
[1] Univ Calif Santa Cruz, Santa Cruz, CA 95064 USA
关键词
Object detection; image representation; correlation and regression analysis; BAYES DECISION RULE; IMAGE; RECOGNITION; REPRESENTATION; SCENE; SHAPE;
D O I
10.1109/TPAMI.2009.153
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a generic detection/localization algorithm capable of searching for a visual object of interest without training. The proposed method operates using a single example of an object of interest to find similar matches, does not require prior knowledge (learning) about objects being sought, and does not require any preprocessing step or segmentation of a target image. Our method is based on the computation of local regression kernels as descriptors from a query, which measure the likeness of a pixel to its surroundings. Salient features are extracted from said descriptors and compared against analogous features from the target image. This comparison is done using a matrix generalization of the cosine similarity measure. We illustrate optimality properties of the algorithm using a naive-Bayes framework. The algorithm yields a scalar resemblance map, indicating the likelihood of similarity between the query and all patches in the target image. By employing nonparametric significance tests and nonmaxima suppression, we detect the presence and location of objects similar to the given query. The approach is extended to account for large variations in scale and rotation. High performance is demonstrated on several challenging data sets, indicating successful detection of objects in diverse contexts and under different imaging conditions.
引用
收藏
页码:1688 / 1704
页数:17
相关论文
共 67 条
  • [1] Learning to detect objects in images via a sparse, part-based representation
    Agarwal, S
    Awan, A
    Roth, D
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2004, 26 (11) : 1475 - 1490
  • [2] Requirements for a cocitation similarity measure, with special reference to Pearson's correlation coefficient
    Ahlgren, P
    Jarneving, B
    Rousseau, R
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2003, 54 (06): : 550 - 560
  • [3] Human Action Recognition in Videos Using Kinematic Features and Multiple Instance Learning
    Ali, Saad
    Shah, Mubarak
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (02) : 288 - 303
  • [4] [Anonymous], 1970, Applied factor analysis
  • [5] [Anonymous], 2007, 2007 IEEE COMP SOC C, DOI DOI 10.1109/IMTC.2007.379340
  • [6] [Anonymous], 1973, Pattern Classification and Scene Analysis
  • [7] Shape matching and object recognition using shape contexts
    Belongie, S
    Malik, J
    Puzicha, J
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (04) : 509 - 522
  • [8] BENGIO Y, 2005, ADV NEURAL INFORM PR, V18, P115
  • [9] Correlational spectral clustering
    Blaschko, Matthew B.
    Lampert, Christoph H.
    [J]. 2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 93 - +
  • [10] In defense of Nearest-Neighbor based image classification
    Boiman, Oren
    Shechtman, Eli
    Irani, Michal
    [J]. 2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 1992 - +