Perceptual multi-channel visual feature fusion for scene categorization

被引:14
作者
Sun, Xiao [1 ]
Liu, Zhenguang [2 ]
Hu, Yuxing [3 ]
Zhang, Luming [1 ]
Zimmermann, Roger [2 ]
机构
[1] Hefei Univ Technol, Sch Comp & Informat, Hefei, Anhui, Peoples R China
[2] Natl Univ Singapore, Sch Comp, Singapore, Singapore
[3] Tsinghua Univ, Sch Aerosp Engn, Beijing, Peoples R China
关键词
Image kernel; Feature fusion; Scene categoriztion; Perception; MACHINE; CLASSIFICATION; MODEL;
D O I
10.1016/j.ins.2017.10.051
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Effectively recognizing sceneries from a variety of categories is an indispensable but challenging technique in computer vision and intelligent systems. In this work, we propose a novel image kernel based on human gaze shifting, aiming at discovering the mechanism of humans perceiving visually/semantically salient regions within a scenery. More specifically, we first design a weakly supervised embedding algorithm which projects the local image features (i.e., graphlets in this work) onto the pre-defined semantic space. Thereby, we describe each graphlet by multiple visual features at both low-level and high-level. It is generally acknowledged that humans attend to only a few regions within a scenery. Thus we formulate a sparsity-constrained graphlet ranking algorithm which incorporates visual clues at both the low-level and the high-level. According to human visual perception, these top-ranked graphlets are either visually or semantically salient. We sequentially connect them into a path which mimics human gaze shifting. Lastly, a so-called gaze shifting kernel (GSK) is calculated based on the learned paths from a collection of scene images. And a kernel SVM is employed for calculating the scene categories. Comprehensive experiments on a series of well-known scene image sets shown the competitiveness and robustness of our GSK. We also demonstrated the high consistency of the predicted path with real human gaze shifting path. (C) 2017 Published by Elsevier Inc.
引用
收藏
页码:37 / 48
页数:12
相关论文
共 50 条
  • [41] Adaptive Multi-Channel Event Segmentation and Feature Extraction for Monitoring Health Outcomes
    She, Xichen
    Zhai, Yaya
    Henao, Ricardo
    Woods, Christopher W.
    Chiu, Christopher
    Ginsburg, Geoffrey S.
    Song, Peter X. K.
    Hero, Alfred O.
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2021, 68 (08) : 2377 - 2388
  • [42] Multi-channel physiological signal emotion recognition based on ReliefF feature selection
    Zhang, Yong
    Cheng, Cheng
    Chen, Tianzhen
    2019 IEEE 25TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2019, : 725 - 730
  • [43] Bird Species Identification Using Spectrogram Based on Multi-Channel Fusion of DCNNs
    Zhang, Feiyu
    Zhang, Luyang
    Chen, Hongxiang
    Xie, Jiangjian
    ENTROPY, 2021, 23 (11)
  • [44] ConFuse: Convolutional Transform Learning Fusion Framework For Multi-Channel Data Analysis
    Gupta, Pooja
    Maggu, Jyoti
    Majumdar, Angshul
    Chouzenoux, Emilie
    Chierchia, Giovanni
    28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 1986 - 1990
  • [45] Multi-View Scene Classification Based on Feature Integration and Evidence Decision Fusion
    Zhou, Weixun
    Shi, Yongxin
    Huang, Xiao
    REMOTE SENSING, 2024, 16 (05)
  • [46] Perceptual authentication hashing for digital images based on multi-domain feature fusion
    Cao, Fang
    Yao, Shifei
    Zhou, Yuanding
    Yao, Heng
    Qin, Chuan
    SIGNAL PROCESSING, 2024, 223
  • [47] Multi-Channel Fusion Classification Method Based on Time-Series Data
    Jin, Xue-Bo
    Yang, Aiqiang
    Su, Tingli
    Kong, Jian-Lei
    Bai, Yuting
    SENSORS, 2021, 21 (13)
  • [48] Convolutional Feature Fusion for Multi-Language Text Detection in Natural Scene Images
    Chandio, Asghar Ali
    Pickering, Mark
    2019 2ND INTERNATIONAL CONFERENCE ON COMPUTING, MATHEMATICS AND ENGINEERING TECHNOLOGIES (ICOMET), 2019,
  • [49] Scene categorization with multiscale category-specific visual words
    Qin, Jianzhao
    Yung, Nelson H. C.
    OPTICAL ENGINEERING, 2009, 48 (04)
  • [50] Multi-channel Hammerstein Identification
    Wei Yue
    Zhang Hai-Tao
    Chen Michael ZhiQiang
    Zhou Tao
    PROCEEDINGS OF THE 29TH CHINESE CONTROL CONFERENCE, 2010, : 6227 - 6232