Modeling Image Patches with a Generic Dictionary of Mini-Epitomes

被引:6
|
作者
Papandreou, George [1 ]
Chen, Liang-Chieh [2 ]
Yuille, Alan L. [2 ]
机构
[1] TTI Chicago, Chicago, IL USA
[2] UC Los Angeles, Los Angeles, CA 90095 USA
来源
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2014年
关键词
CLASSIFICATION; TEXTURE; SPARSE; FEATURES;
D O I
10.1109/CVPR.2014.264
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The goal of this paper is to question the necessity of features like SIFT in categorical visual recognition tasks. As an alternative, we develop a generative model for the raw intensity of image patches and show that it can support image classification performance on par with optimized SIFT-based techniques in a bag-of-visual-words setting. Key ingredient of the proposed model is a compact dictionary of mini-epitomes, learned in an unsupervised fashion on a large collection of images. The use of epitomes allows us to explicitly account for photometric and position variability in image appearance. We show that this flexibility considerably increases the capacity of the dictionary to accurately approximate the appearance of image patches and support recognition tasks. For image classification, we develop histogram-based image encoding methods tailored to the epitomic representation, as well as an "epitomic footprint" encoding which is easy to visualize and highlights the generative nature of our model. We discuss in detail computational aspects and develop efficient algorithms to make the model scalable to large tasks. The proposed techniques are evaluated with experiments on the challenging PASCAL VOC 2007 image classification benchmark.
引用
收藏
页码:2059 / 2066
页数:8
相关论文
共 19 条
  • [1] Learning a Dictionary of Shape Epitomes with Applications to Image Labeling
    Chen, Liang-Chieh
    Papandreou, George
    Yuille, Alan L.
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 337 - 344
  • [2] K-SVD dictionary learning and image reconstruction based on variance of image patches
    Cong, Yuliang
    Zhang, Shuyang
    Lian, Yuying
    2015 8TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2015, : 254 - 257
  • [3] NEIGHBORING IMAGE PATCHES EMBEDDING FOR BACKGROUND MODELING
    Zhong, Bineng
    Yao, Hongxun
    Liu, Shaohui
    2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 3209 - 3212
  • [4] Hard Patches Mining for Masked Image Modeling
    Wang, Haochen
    Song, Kaiyou
    Fan, Junsong
    Wang, Yuxi
    Xie, Jin
    Zhang, Zhaoxiang
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 10375 - 10385
  • [5] Dictionary Learning Algorithm for Compressed-Sensing Based on the Entropy of Image Patches
    Liu L.
    Wang X.-T.
    Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2019, 39 (05): : 520 - 523
  • [6] Deep Convolutional Dictionary Learning Denoising Method Based on Distributed Image Patches
    Yin, Luqiao
    Gao, Wenqing
    Liu, Jingjing
    ELECTRONICS, 2024, 13 (07)
  • [7] Blind Image Deblurring Using Sparse Dictionary And Orthogonal Projections Of Filtered Patches
    EL-Henawy, I. M.
    Ali, A. A.
    Ahmed, Kareem
    Adel, Hadeer
    2015 TENTH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING & SYSTEMS (ICCES), 2015, : 274 - 280
  • [8] JOINT DICTIONARY LEARNING AND TOPIC MODELING FOR IMAGE CLUSTERING
    Li, Lingbo
    Zhou, Mingyuan
    Wang, Eric
    Carin, Lawrence
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 2168 - 2171
  • [9] Sparse and Redundant Modeling of Image Content Using an Image-Signature-Dictionary
    Aharon, Michal
    Elad, Michael
    SIAM JOURNAL ON IMAGING SCIENCES, 2008, 1 (03): : 228 - 247