Perceptual multi-channel visual feature fusion for scene categorization

被引:14
|
作者
Sun, Xiao [1 ]
Liu, Zhenguang [2 ]
Hu, Yuxing [3 ]
Zhang, Luming [1 ]
Zimmermann, Roger [2 ]
机构
[1] Hefei Univ Technol, Sch Comp & Informat, Hefei, Anhui, Peoples R China
[2] Natl Univ Singapore, Sch Comp, Singapore, Singapore
[3] Tsinghua Univ, Sch Aerosp Engn, Beijing, Peoples R China
关键词
Image kernel; Feature fusion; Scene categoriztion; Perception; MACHINE; CLASSIFICATION; MODEL;
D O I
10.1016/j.ins.2017.10.051
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Effectively recognizing sceneries from a variety of categories is an indispensable but challenging technique in computer vision and intelligent systems. In this work, we propose a novel image kernel based on human gaze shifting, aiming at discovering the mechanism of humans perceiving visually/semantically salient regions within a scenery. More specifically, we first design a weakly supervised embedding algorithm which projects the local image features (i.e., graphlets in this work) onto the pre-defined semantic space. Thereby, we describe each graphlet by multiple visual features at both low-level and high-level. It is generally acknowledged that humans attend to only a few regions within a scenery. Thus we formulate a sparsity-constrained graphlet ranking algorithm which incorporates visual clues at both the low-level and the high-level. According to human visual perception, these top-ranked graphlets are either visually or semantically salient. We sequentially connect them into a path which mimics human gaze shifting. Lastly, a so-called gaze shifting kernel (GSK) is calculated based on the learned paths from a collection of scene images. And a kernel SVM is employed for calculating the scene categories. Comprehensive experiments on a series of well-known scene image sets shown the competitiveness and robustness of our GSK. We also demonstrated the high consistency of the predicted path with real human gaze shifting path. (C) 2017 Published by Elsevier Inc.
引用
收藏
页码:37 / 48
页数:12
相关论文
共 50 条
  • [31] Feature fusion of fruit image categorization using machine learning
    Fatima, Shameem
    Seshashayee, M.
    INTERNATIONAL JOURNAL OF NONLINEAR ANALYSIS AND APPLICATIONS, 2022, 13 : 71 - 76
  • [32] A fusion way of feature extraction for automatic categorization of music genres
    Sharma, Dhruv
    Taran, Sachin
    Pandey, Anukul
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (16) : 25015 - 25038
  • [33] Scene Categorization by Introducing Contextual Information to the Visual Words
    Qiu, Jianzhao
    Yung, Nelson H. C.
    ADVANCES IN VISUAL COMPUTING, PT 1, PROCEEDINGS, 2009, 5875 : 297 - 306
  • [34] Multi-modal sarcasm detection based on Multi-Channel Enhanced Fusion model
    Fang, Hong
    Liang, Dahao
    Xiang, Weiyu
    NEUROCOMPUTING, 2024, 578
  • [35] Retinal artery/vein classification by multi-channel multi-scale fusion network
    Junyan Yi
    Chouyu Chen
    Gang Yang
    Applied Intelligence, 2023, 53 : 26400 - 26417
  • [36] Retinal artery/vein classification by multi-channel multi-scale fusion network
    Yi, Junyan
    Chen, Chouyu
    Yang, Gang
    APPLIED INTELLIGENCE, 2023, 53 (22) : 26400 - 26417
  • [37] Feature Fusion for Scene Text Detection
    Zhu, Zhen
    Liao, Minghui
    Shi, Baoguang
    Bai, Xiang
    2018 13TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS), 2018, : 193 - 198
  • [38] CNN Based Multi-Object Segmentation and Feature Fusion for Scene Recognition
    Rafique, Adnan Ahmed
    Ghadi, Yazeed Yasin
    Alsuhibany, Suliman A.
    Chelloug, Samia Allaoua
    Jalal, Ahmad
    Park, Jeongmin
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (03): : 4657 - 4675
  • [39] A Modified Multi-Channel EMG Feature for Upper Limb Motion Pattern Recognition
    Tsai, An-Chih
    Luh, Jer-Junn
    Lin, Ta-Te
    2012 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2012, : 3596 - 3599
  • [40] Accelerated Feature Extraction and Refinement for Improved Aerial Scene Categorization
    Tu, Xiaohan
    Yang, Laurence Tianruo
    Liu, Siping
    Li, Renfa
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 17