Perceptual multi-channel visual feature fusion for scene categorization

被引:14
|
作者
Sun, Xiao [1 ]
Liu, Zhenguang [2 ]
Hu, Yuxing [3 ]
Zhang, Luming [1 ]
Zimmermann, Roger [2 ]
机构
[1] Hefei Univ Technol, Sch Comp & Informat, Hefei, Anhui, Peoples R China
[2] Natl Univ Singapore, Sch Comp, Singapore, Singapore
[3] Tsinghua Univ, Sch Aerosp Engn, Beijing, Peoples R China
关键词
Image kernel; Feature fusion; Scene categoriztion; Perception; MACHINE; CLASSIFICATION; MODEL;
D O I
10.1016/j.ins.2017.10.051
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Effectively recognizing sceneries from a variety of categories is an indispensable but challenging technique in computer vision and intelligent systems. In this work, we propose a novel image kernel based on human gaze shifting, aiming at discovering the mechanism of humans perceiving visually/semantically salient regions within a scenery. More specifically, we first design a weakly supervised embedding algorithm which projects the local image features (i.e., graphlets in this work) onto the pre-defined semantic space. Thereby, we describe each graphlet by multiple visual features at both low-level and high-level. It is generally acknowledged that humans attend to only a few regions within a scenery. Thus we formulate a sparsity-constrained graphlet ranking algorithm which incorporates visual clues at both the low-level and the high-level. According to human visual perception, these top-ranked graphlets are either visually or semantically salient. We sequentially connect them into a path which mimics human gaze shifting. Lastly, a so-called gaze shifting kernel (GSK) is calculated based on the learned paths from a collection of scene images. And a kernel SVM is employed for calculating the scene categories. Comprehensive experiments on a series of well-known scene image sets shown the competitiveness and robustness of our GSK. We also demonstrated the high consistency of the predicted path with real human gaze shifting path. (C) 2017 Published by Elsevier Inc.
引用
收藏
页码:37 / 48
页数:12
相关论文
共 50 条
  • [21] Perceptual Visual Feature Learning With Applications in Sports Educational Image Understanding
    Liu, Tengsheng
    Xu, Minghui
    IEEE ACCESS, 2024, 12 : 41168 - 41179
  • [22] Birdsong classification based on multi feature channel fusion
    Liu, Zhihua
    Chen, Wenjie
    Chen, Aibin
    Zhou, Guoxiong
    Yi, Jizheng
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (11) : 15469 - 15490
  • [23] Birdsong classification based on multi feature channel fusion
    Zhihua Liu
    Wenjie Chen
    Aibin Chen
    Guoxiong Zhou
    Jizheng Yi
    Multimedia Tools and Applications, 2022, 81 : 15469 - 15490
  • [24] Fine-Grained Visual Categorization: A Spatial-Frequency Feature Fusion Perspective
    Wang, Min
    Zhao, Peng
    Lu, Xin
    Min, Fan
    Wang, Xizhao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (06) : 2798 - 2812
  • [25] Visual saliency detection based on multi-scale and multi-channel mean
    Sun, Lang
    Tang, Yan
    Zhang, Hong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (01) : 667 - 684
  • [26] Feature Selection based Codebooks Construction for Scene Categorization
    Xie, Wenjie
    Xu, De
    Feng, Songhe
    Tang, Yingjun
    2010 IEEE 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS (ICSP2010), VOLS I-III, 2010, : 948 - 951
  • [27] Remaining useful life prediction of rolling bearings based on CNN-GRU-MSA with multi-channel feature fusion
    Yan, Xiaoan
    Jin, Xiaopeng
    Jiang, Dong
    Xiang, Ling
    NONDESTRUCTIVE TESTING AND EVALUATION, 2024,
  • [28] Multi-channel EEG Classification Based on Fast Convolutional Feature Extraction
    Wang, Qian
    Hu, Yongjun
    Chen, He
    ADVANCES IN NEURAL NETWORKS, PT II, 2017, 10262 : 533 - 540
  • [29] A Novel Methodology for Microgrid Power Quality Disturbance Classification Using URPM-CWT and Multi-Channel Feature Fusion
    Jiang, Junzhuo
    Wu, Hao
    Zhong, Changhua
    Cai, Yuan
    Song, Hong
    IEEE ACCESS, 2024, 12 : 35597 - 35611
  • [30] Multidimensional Feature in Emotion Recognition Based on Multi-Channel EEG Signals
    Li, Qi
    Liu, Yunqing
    Liu, Quanyang
    Zhang, Qiong
    Yan, Fei
    Ma, Yimin
    Zhang, Xinyu
    ENTROPY, 2022, 24 (12)