Fusing Attention Features and Contextual Information for Scene Recognition

被引:1
作者
Peng, Yuqing [1 ]
Liu, Xianzi [2 ]
Wang, Chenxi [1 ]
Xiao, Tengfei [1 ]
Li, Tiejun [3 ]
机构
[1] Hebei Univ Technol, Sch Artificial Intelligence, Tianjin 300401, Peoples R China
[2] China Shenhua Int Engn Co Ltd, Beijing 100007, Peoples R China
[3] Hebei Univ Technol, Sch Mech Engn, Tianjin 300401, Peoples R China
基金
中国国家自然科学基金;
关键词
Scene recognition; muti-scale attention; joint supervision; context information; CLASSIFICATION;
D O I
10.1142/S0218001422500148
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Aiming to obtain more discriminative features in scene images and overcome the impacts of intra-class differences and inter-class similarities, the paper proposes a scene recognition method that combines attention and context information. First, we introduce the attention mechanism and build a multi-scale attention model. Discriminative information considers salient objects and regions by means of channel attention and spatial attention. Besides, the central loss function joint supervision strategy is introduced to further reduce the misjudgment of intra-class differences. Second, a model based on multi-level context information is proposed to describe the positional relationship between objects, which can effectively alleviate the influence of the similarity of objects between classes. Finally, the two models are merged to give full play to the compatibility of features, so that the final feature representation not only focuses on the effective discriminant information, but also manifests the relative position relationship between significant objects. Extensive experiments have proved that the method in this paper effectively solves the problem of insufficient feature representation in scene recognition tasks, and improves the accuracy of scene recognition.
引用
收藏
页数:21
相关论文
共 31 条
  • [1] Coordinate CNNs and LSTMs to categorize scene images with multi-views and multi-levels of abstraction
    Bai, Shuang
    Tang, Huadong
    An, Shan
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2019, 120 : 298 - 309
  • [2] CBSA: Content-based soft annotation for multimodal image retrieval using Bayes point machines
    Chang, E
    Goh, K
    Sychay, G
    Wu, G
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2003, 13 (01) : 26 - 38
  • [3] Scene Recognition With Prototype-Agnostic Scene Layout
    Chen, Gongwei
    Song, Xinhang
    Zeng, Haitao
    Jiang, Shuqiang
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 5877 - 5888
  • [4] Cheng G, 2015, PROC CVPR IEEE, P1173, DOI 10.1109/CVPR.2015.7298721
  • [5] Scene recognition with objectness
    Cheng, Xiaojuan
    Lu, Jiwen
    Feng, Jianjiang
    Yuan, Bo
    Zhou, Jie
    [J]. PATTERN RECOGNITION, 2018, 74 : 474 - 487
  • [6] Locally Supervised Deep Hybrid Model for Scene Recognition
    Guo, Sheng
    Huang, Weilin
    Wang, Limin
    Qiao, Yu
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (02) : 808 - 820
  • [7] Biologically inspired task oriented gist model for scene classification
    Han, Yina
    Liu, Guizhong
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2013, 117 (01) : 76 - 95
  • [8] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [9] A Discriminative Representation of Convolutional Features for Indoor Scene Recognition
    Khan, Salman H.
    Hayat, Munawar
    Bennamoun, Mohammed
    Togneri, Roberto
    Sohel, Ferdous A.
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (07) : 3372 - 3383
  • [10] ImageNet Classification with Deep Convolutional Neural Networks
    Krizhevsky, Alex
    Sutskever, Ilya
    Hinton, Geoffrey E.
    [J]. COMMUNICATIONS OF THE ACM, 2017, 60 (06) : 84 - 90