Cross-Layer Feature based Multi-Granularity Visual Classification

被引:0
|
作者
Chen, Junhan [1 ]
Chang, Dongliang [1 ]
Xie, Jiyang [1 ]
Du, Ruoyi [1 ]
Ma, Zhanyu [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Pattern Recognit & Intelligent Syst Lab, Beijing, Peoples R China
来源
2022 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP) | 2022年
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
multi granularity visual classification; feature pyramid structure; disentanglement; reinforcement;
D O I
10.1109/VCIP56404.2022.10008879
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In contrast to traditional fine-grained visual classification, multi-granularity visual classification is no longer limited to identifying the different sub-classes belonging to the same super-class (e.g., bird species, cars, and aircraft models). Instead, it gives a sequence of labels from coarse to fine (e.g., Passeriformes -> Corvidae -> Fish Crow), which is more convenient in practice. The key to solving this task is how to use the relationships between the different levels of labels to learn feature representations that contain different levels of granularity. Interestingly, the feature pyramid structure naturally implies different granularity of feature representation, with the shallow layers representing coarse-grained features and the deep layers representing fine-grained features. Therefore, in this paper, we exploit this property of the feature pyramid structure to decouple features and obtain feature representations corresponding to different granularities. Specifically, we use shallow features for coarse-grained classification and deep features for fine-grained classification. In addition, to enable fine-grained features to enhance the coarse-grained classification, we propose a feature reinforcement module based on the feature pyramid structure, where deep features are first upsampled and then combined with shallow features to make decisions. Experimental results on three widely used fine-grained image classification datasets such as CUB-200-2011, Stanford Cars, and FGVC-Aircraft validate the method's effectiveness. Code available at https://github.com/PRIS-CV/CGVC
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Multi-Frequency Feature Enhancement for Multi-Granularity Visual Classification
    Fu, Meijiang
    Zheng, Yixiao
    Chang, Dongliang
    Li, Wenpan
    Ma, Zhanyu
    2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 484 - 489
  • [2] A cross-layer optimization based integrated routing and grooming algorithm for green multi-granularity transport networks
    Wang, Xingwei
    Cheng, Hui
    Li, Keqin
    Li, Jie
    Sun, Jiajia
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2013, 73 (06) : 807 - 822
  • [3] Multi-Granularity Feature Distillation Learning Network for Fine-Grained Visual Classification
    Cai, Yuhang
    Ke, Xiao
    2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, : 300 - 303
  • [4] Multi-Granularity Feature Fusion for Enhancing Encrypted Traffic Classification
    Ding, Quan
    Zha, Zhengpeng
    Li, Yanjun
    Ling, Zhenhua
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (04) : 1090 - 1097
  • [5] A Pedestrian Tracking Algorithm Based on Multi-Granularity Feature
    Wang Z.
    Miao D.
    Zhao C.
    Luo S.
    Wei Z.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2020, 57 (05): : 996 - 1002
  • [6] Multi-granularity visual explanations for CNN
    Bao, Huanan
    Wang, Guoyin
    Li, Shuai
    Liu, Qun
    KNOWLEDGE-BASED SYSTEMS, 2022, 253
  • [7] Deconfounded hierarchical multi-granularity classification
    Zhao, Ziyu
    Gan, Leilei
    Shen, Tao
    Kuang, Kun
    Wu, Fei
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 248
  • [8] Multi-Granularity Cross-Attention Network for Visual Question Answering
    Wang, Yue
    Gao, Wei
    Cheng, Xinzhou
    Wang, Xin
    Zhao, Huiying
    Xie, Zhipu
    Xu, Lexi
    2023 IEEE 22ND INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, BIGDATASE, CSE, EUC, ISCI 2023, 2024, : 2098 - 2103
  • [9] Automatic ICD Coding Based on Multi-granularity Feature Fusion
    Yu, Ying
    Duan, Junwen
    Jiang, Han
    Wang, Jianxin
    BIOINFORMATICS RESEARCH AND APPLICATIONS, ISBRA 2022, 2022, 13760 : 19 - 29
  • [10] Multi-Granularity Ensemble Classification Algorithm Based on Attribute Representation
    Zhang Q.-H.
    Zhi X.-C.
    Wang G.-Y.
    Yang F.
    Xue F.-Z.
    Jisuanji Xuebao/Chinese Journal of Computers, 2022, 45 (08): : 1712 - 1729