Biologically inspired task oriented gist model for scene classification

被引:8
作者
Han, Yina [1 ,2 ]
Liu, Guizhong [3 ]
机构
[1] Northwestern Polytech Univ, Sch Marine Engn, Xian 710072, Peoples R China
[2] Chinese Acad Sci, Inst Acoust, State Key Lab Acoust, Beijing 100190, Peoples R China
[3] Xi An Jiao Tong Univ, Sch Elect & Informat Engn, Xian 710049, Peoples R China
基金
中国国家自然科学基金;
关键词
Scene classification; Scene gist; Localized multiple kernel learning; SVM; IMAGE FEATURES; RECOGNITION;
D O I
10.1016/j.cviu.2012.10.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Capturing the scene gist is account for rapid and accurate scene classification in human visual system. This paper presents a biologically inspired task oriented gist model (BT-Gist) that attempts to emulate two important attributes of biological gist: holistic scene centered spatial layout representation and task oriented resolution determination. For the first attribute, we enrich the model of Oliva and Torralba by refining the low-level features in several biological plausible ways, extending the spatial layout to multiple resolution and followed by perceptually meaningful manifold analysis for a set of multi-resolution biologically inspired intrinsic manifold spatial layouts (BMSLs). Since the optimal resolution that best represents the spatial layout varies from task to task, we embody the second attribute as learning the combination of BMSLs of multiple resolution with respect to their optimal discriminative invariance trade-off for the task at hand, and then cast it in the SVM based localized multiple kernel learning (LMKL) framework, by which the kernel of each scene gist is approximated as a local combination of kernels associated to multi-resolution BMSLs. By exploring the task specific category distribution pattern over BMSL, we define the local model as a category distribution sensitive (CDS) kernel, which can accommodate both the diverse individuality of specific BMSL and the universality shared within the whole category space. Via CDS-LMKL, both the optimal resolution for spatial layouts and the final classifier can be efficiently obtained in a joint manner. We evaluate BT-Gist on four natural scene databases and one cluttered indoor scene database with a range of comparison: From different MKL methods, to various biologically inspired models and BoF based computer vision models. CDS-LMKL leads to better results compared to several existing MKL algorithms. Given the two biological attributes that the framework has to follow, BT-Gist, despite its holistic nature, outperforms existing biologically inspired models and BoF based computer vision models in natural scene classification, and competes with the object segmentation based ROI-Gist in cluttered indoor scene classification. (C) 2012 Elsevier Inc. All rights reserved.
引用
收藏
页码:76 / 95
页数:20
相关论文
共 62 条
  • [1] [Anonymous], 2006, Pattern recognition and machine learning
  • [2] Bosch A., ECCV, V4, P517
  • [3] Scene classification using a hybrid generative/discriminative approach
    Bosch, Anna
    Zisserman, Andrew
    Munoz, Xavier
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (04) : 712 - 727
  • [4] Christoudias M., 2009, TECHNICAL REPORT
  • [5] Cortes C., NIPS, P396
  • [6] Cristianini N. S., NIPS, P367
  • [7] Colour tuning in human visual cortex measured with functional magnetic resonance imaging
    Engel, S
    Zhang, XM
    Wandell, B
    [J]. NATURE, 1997, 388 (6637) : 68 - 71
  • [8] Fei-Fei L, 2005, PROC CVPR IEEE, P524
  • [9] What do we perceive in a glance of a real-world scene?
    Fei-Fei, Li
    Iyer, Asha
    Koch, Christof
    Perona, Pietro
    [J]. JOURNAL OF VISION, 2007, 7 (01):
  • [10] Frome A., ICCV, P1