Context-aware Attentional Pooling (CAP) for Fine-grained Visual Classification

被引:0
|
作者
Behera, Ardhendu [1 ]
Wharton, Zachary [1 ]
Hewage, Pradeep R. P. G. [1 ]
Bera, Asish [1 ]
机构
[1] Edge Hill Univ, Dept Comp Sci, St Helen Rd, Ormskirk L39 4QP, Lancs, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep convolutional neural networks (CNNs) have shown a strong ability in mining discriminative object pose and parts information for image recognition. For fine-grained recognition, context-aware rich feature representation of object/scene plays a key role since it exhibits a significant variance in the same subcategory and subtle variance among different subcategories. Finding the subtle variance that fully characterizes the object/scene is not straightforward. To address this, we propose a novel context-aware attentional pooling (CAP) that effectively captures subtle changes via sub-pixel gradients, and learns to attend informative integral regions and their importance in discriminating different subcategories without requiring the bounding-box and/or distinguishable part annotations. We also introduce a novel feature encoding by considering the intrinsic consistency between the informativeness of the integral regions and their spatial structures to capture the semantic correlation among them. Our approach is simple yet extremely effective and can be easily applied on top of a standard classification backbone network. We evaluate our approach using six state-of-the-art (SotA) backbone networks and eight benchmark datasets. Our method significantly outperforms the SotA approaches on six datasets and is very competitive with the remaining two.
引用
收藏
页码:929 / 937
页数:9
相关论文
共 50 条
  • [31] Pairwise Confusion for Fine-Grained Visual Classification
    Dubey, Abhimanyu
    Gupta, Otkrist
    Guo, Pei
    Raskar, Ramesh
    Farrell, Ryan
    Naik, Nikhil
    COMPUTER VISION - ECCV 2018, PT XII, 2018, 11216 : 71 - 88
  • [32] Fine-Grained Meetup Events Extraction Through Context-Aware Event Argument Positioning and Recognition
    Lin, Yuan-Hao
    Chang, Chia-Hui
    Chuang, Hsiu-Min
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2024, 17 (01)
  • [33] Attentional Kernel Encoding Networks for Fine-Grained Visual Categorization
    Hu, Yutao
    Yang, Yandan
    Zhang, Jun
    Cao, Xianbin
    Zhen, Xiantong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (01) : 301 - 314
  • [34] Fine-Grained Vehicle Classification With Channel Max Pooling Modified CNNs
    Ma, Zhanyu
    Chang, Dongliang
    Xie, Jiyang
    Ding, Yifeng
    Wen, Shaoguo
    Li, Xiaoxu
    Si, Zhongwei
    Guo, Jun
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (04) : 3224 - 3233
  • [35] A fine-grained context-aware access control model for health care and life science linked data
    Liu, Zhengtao
    Wang, Jiandong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (22) : 14263 - 14280
  • [36] A fine-grained context-aware access control model for health care and life science linked data
    Zhengtao Liu
    Jiandong Wang
    Multimedia Tools and Applications, 2016, 75 : 14263 - 14280
  • [37] Fine-grained entity type classification with adaptive context
    Liu, Jin
    Wang, Lina
    Zhou, Mingji
    Wang, Jin
    Lee, Sungyoung
    SOFT COMPUTING, 2018, 22 (13) : 4307 - 4318
  • [38] Fine-grained entity type classification with adaptive context
    Jin Liu
    Lina Wang
    Mingji Zhou
    Jin Wang
    Sungyoung Lee
    Soft Computing, 2018, 22 : 4307 - 4318
  • [39] Efficient Image Embedding for Fine-Grained Visual Classification
    Payatsuporn, Soranan
    Kijsirikul, Boonserm
    2022-14TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SMART TECHNOLOGY (KST 2022), 2022, : 40 - 45
  • [40] Adaptive Destruction Learning for Fine-grained Visual Classification
    Zhang, Riheng
    Tan, Min
    Mao, Xiaoyang
    Gao, Zhigang
    Gu, Xiaoling
    2022 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS (DASC/PICOM/CBDCOM/CYBERSCITECH), 2022, : 946 - 950