An Attention-Based Architecture for Hierarchical Classification With CNNs

被引:5
作者
Pizarro, Ivan [1 ]
Nanculef, Ricardo [1 ]
Valle, Carlos [2 ]
机构
[1] Univ Tecn Federico Santa Maria, Dept Informat, Valparaiso 2390123, Chile
[2] Univ Playa Ancha, Dept Data Sci & Informat, Valparaiso 2360072, Chile
关键词
Taxonomy; Measurement; Computer architecture; Training; Convolutional neural networks; Classification algorithms; Predictive models; Attention mechanisms; deep learning; hierarchical classification; CONVOLUTIONAL NEURAL-NETWORKS;
D O I
10.1109/ACCESS.2023.3263472
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Branch Convolutional Neural Nets have become a popular approach for hierarchical classification in computer vision and other areas. Unfortunately, these models often led to hierarchical inconsistency: predictions for the different hierarchy levels do not necessarily respect the class-subclass constraints imposed by the hierarchy. Several architectures to connect the branches have arisen to overcome this limitation. In this paper, we propose a more straightforward and flexible method: let the neural net decide how these branches must be connected. We achieve this by formulating an attention mechanism that dynamically determines how branches influence each other during training and inference. Experiments on image classification benchmarks show that the proposed method can outperform state-of-the-art models in terms of hierarchical performance metrics and consistency. Furthermore, although sometimes we found a slightly lower performance at the deeper level of the hierarchy, the model predicts much more accurately the ground-truth path between a concept and its ancestors in the hierarchy. This result suggests that the model does learn not only local class memberships but also hierarchical dependencies between concepts.
引用
收藏
页码:32972 / 32995
页数:24
相关论文
共 54 条
  • [21] Kiritchenko S, 2006, LECT NOTES ARTIF INT, V4013, P395, DOI 10.1007/11766247_34
  • [22] Condition-CNN: A hierarchical multi-label fashion image classification model
    Kolisnik, Brendan
    Hogan, Isaac
    Zulkernine, Farhana
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 182
  • [23] Evaluation measures for hierarchical classification: a unified view and novel approaches
    Kosmopoulos, Aris
    Partalas, Ioannis
    Gaussier, Eric
    Paliouras, Georgios
    Androutsopoulos, Ion
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2015, 29 (03) : 820 - 865
  • [24] Kowsari Kamran., 2019, Proceedings of the Future Technologies Conference, P750
  • [25] Krizhevsky A., 2009, Handbook of Systemic Autoimmune Diseases, V1
  • [26] ImageNet Classification with Deep Convolutional Neural Networks
    Krizhevsky, Alex
    Sutskever, Ilya
    Hinton, Geoffrey E.
    [J]. COMMUNICATIONS OF THE ACM, 2017, 60 (06) : 84 - 90
  • [27] Integrating multi-level deep learning and concept ontology for large-scale visual recognition
    Kuang, Zhenzhong
    Yu, Jun
    Li, Zongmin
    Zhang, Baopeng
    Fan, Jianping
    [J]. PATTERN RECOGNITION, 2018, 78 : 198 - 214
  • [28] Learn class hierarchy using convolutional neural networks
    La Grassa, Riccardo
    Gallo, Ignazio
    Landro, Nicola
    [J]. APPLIED INTELLIGENCE, 2021, 51 (10) : 6622 - 6632
  • [29] Yahoo! as an ontology - Using Yahoo! categories to describe documents
    Labrou, Y
    Finin, T
    [J]. PROCEEDINGS OF THE EIGHTH INTERNATIONAL CONFERENCE ON INFORMATION KNOWLEDGE MANAGEMENT, CIKM'99, 1999, : 180 - 187
  • [30] Emotion recognition using a hierarchical binary decision tree approach
    Lee, Chi-Chun
    Mower, Emily
    Busso, Carlos
    Lee, Sungbok
    Narayanan, Shrikanth
    [J]. SPEECH COMMUNICATION, 2011, 53 (9-10) : 1162 - 1171