Fusing Attention Features and Contextual Information for Scene Recognition

被引：1

作者：

Peng, Yuqing ^{[1
]}

Liu, Xianzi ^{[2
]}

Wang, Chenxi ^{[1
]}

Xiao, Tengfei ^{[1
]}

Li, Tiejun ^{[3
]}

机构：

[1] Hebei Univ Technol, Sch Artificial Intelligence, Tianjin 300401, Peoples R China

[2] China Shenhua Int Engn Co Ltd, Beijing 100007, Peoples R China

[3] Hebei Univ Technol, Sch Mech Engn, Tianjin 300401, Peoples R China

来源：

INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE | 2022年 / 36卷 / 03期

基金：

中国国家自然科学基金;

关键词：

Scene recognition; muti-scale attention; joint supervision; context information; CLASSIFICATION;

D O I：

10.1142/S0218001422500148

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Aiming to obtain more discriminative features in scene images and overcome the impacts of intra-class differences and inter-class similarities, the paper proposes a scene recognition method that combines attention and context information. First, we introduce the attention mechanism and build a multi-scale attention model. Discriminative information considers salient objects and regions by means of channel attention and spatial attention. Besides, the central loss function joint supervision strategy is introduced to further reduce the misjudgment of intra-class differences. Second, a model based on multi-level context information is proposed to describe the positional relationship between objects, which can effectively alleviate the influence of the similarity of objects between classes. Finally, the two models are merged to give full play to the compatibility of features, so that the final feature representation not only focuses on the effective discriminant information, but also manifests the relative position relationship between significant objects. Extensive experiments have proved that the method in this paper effectively solves the problem of insufficient feature representation in scene recognition tasks, and improves the accuracy of scene recognition.

引用

页数：21

共 31 条

[1] Coordinate CNNs and LSTMs to categorize scene images with multi-views and multi-levels of abstraction
Bai, Shuang
Tang, Huadong
An, Shan
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2019, 120 : 298 - 309
[2] CBSA: Content-based soft annotation for multimodal image retrieval using Bayes point machines
Chang, E
Goh, K
Sychay, G
Wu, G
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2003, 13 (01) : 26 - 38
[3] Scene Recognition With Prototype-Agnostic Scene Layout
Chen, Gongwei
Song, Xinhang
Zeng, Haitao
Jiang, Shuqiang
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 5877 - 5888
[4] Cheng G, 2015, PROC CVPR IEEE, P1173, DOI 10.1109/CVPR.2015.7298721
[5] Scene recognition with objectness
Cheng, Xiaojuan
Lu, Jiwen
Feng, Jianjiang
Yuan, Bo
Zhou, Jie
[J]. PATTERN RECOGNITION, 2018, 74 : 474 - 487
[6] Locally Supervised Deep Hybrid Model for Scene Recognition
Guo, Sheng
Huang, Weilin
Wang, Limin
Qiao, Yu
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (02) : 808 - 820
[7] Biologically inspired task oriented gist model for scene classification
Han, Yina
Liu, Guizhong
[J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2013, 117 (01) : 76 - 95
[8] Deep Residual Learning for Image Recognition
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
[9] A Discriminative Representation of Convolutional Features for Indoor Scene Recognition
Khan, Salman H.
Hayat, Munawar
Bennamoun, Mohammed
Togneri, Roberto
Sohel, Ferdous A.
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (07) : 3372 - 3383
[10] ImageNet Classification with Deep Convolutional Neural Networks
Krizhevsky, Alex
Sutskever, Ilya
Hinton, Geoffrey E.
[J]. COMMUNICATIONS OF THE ACM, 2017, 60 (06) : 84 - 90

← 1 2 3 4 →