Scene categorization based on local-global feature fusion and multi-scale multi-spatial resolution encoding

被引:2
作者
Qin, Jianzhao [1 ]
Deng, Fuqin [2 ]
Yung, Nelson H. C. [3 ]
机构
[1] Smart China Holdings Ltd, Smart China Res, Shatin, Hong Kong, Peoples R China
[2] ASM Pacific Technol Ltd, Kwai Chung, Hong Kong, Peoples R China
[3] Univ Hong Kong, Dept Elect & Elect Engn, Lab Intelligent Transportat Syst Res, Pokfulam, Hong Kong, Peoples R China
关键词
Scene categorization; Local-global feature fusion; Multi-scale multi-spatial resolution encoding; CLASSIFICATION;
D O I
10.1007/s11760-014-0650-7
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
With the bag-of-contextual-visual-word (BOCVW) models, we propose a scene categorization method based on local-global feature fusion and multi-scale multi-spatial resolution encoding. First, the performances of the BOCVW models belonging to different features are mutually reinforced by fusing other types of features within local regions. Then, the spatial configuration information is explored using a multi-scale multi-spatial resolution encoding approach. Furthermore, these encoded BOCVW models are globally fused using an improved maximum-margin optimization strategy, which considers the margin between input vectors of different categories and the diameter of the smallest ball containing feature vectors simultaneously. The proposed method has been evaluated on three scene categorization datasets consisting of scene categories 8, 15, and 67, respectively. And its effectiveness has been verified by these experimental results.
引用
收藏
页码:S145 / S154
页数:10
相关论文
共 37 条
  • [1] [Anonymous], 2007, Computer Vision
  • [2] [Anonymous], P 2005 IEEE COMP SOC
  • [3] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 993 - 1022
  • [4] Scene classification using a hybrid generative/discriminative approach
    Bosch, Anna
    Zisserman, Andrew
    Munoz, Xavier
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (04) : 712 - 727
  • [5] Bosch A, 2006, LECT NOTES COMPUT SC, V3954, P517
  • [6] Learning Mid-Level Features For Recognition
    Boureau, Y-Lan
    Bach, Francis
    LeCun, Yann
    Ponce, Jean
    [J]. 2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 2559 - 2566
  • [7] Fei-Fei L, 2005, PROC CVPR IEEE, P524
  • [8] On Feature Combination for Multiclass Object Classification
    Gehler, Peter
    Nowozin, Sebastian
    [J]. 2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, : 221 - 228
  • [9] Unsupervised learning by probabilistic latent semantic analysis
    Hofmann, T
    [J]. MACHINE LEARNING, 2001, 42 (1-2) : 177 - 196
  • [10] Kwitt Roland, 2012, Computer Vision - ECCV 2012. Proceedings of the 12th European Conference on Computer Vision, P359, DOI 10.1007/978-3-642-33765-9_26