Semantic Hierarchy-based Convolutional Neural Networks for Image Classification

被引:2
作者
Inoue, Matheus [1 ,2 ]
Forster, Carlos Henrique [2 ]
dos Santos, Antonio Carlos [3 ]
机构
[1] Univ Sao Paulo, Polytech Sch, Sao Paulo, Brazil
[2] Aeronaut Inst Technol, Comp Sci Div, Sao Jose Dos Campos, Brazil
[3] Itau Unibanco, Data Sci Team, Sao Paulo, Brazil
来源
2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2020年
关键词
Convolutional Neural Networks; Hierarchical Image classification; Deep Learning; Computer Vision;
D O I
10.1109/ijcnn48605.2020.9207246
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, three variations of hierarchical topologies of Convolutional Neural Networks (CNNs), two of which being original proposals introduced by this work, were tested to assess their impact on image classification problems. The hierarchical structure groups the images based on the semantic meaning of the classes, from the coarsest classes to the finest classes, forming hierarchical levels. The hierarchical models made were compared to a baseline regular CNN on benchmark image classification datasets, the Fashion-MNIST and CIFAR-100 datasets. Another contribution of this work is a new training strategy for hierarchical CNNs, that aims to be simple to implement and to produce a smooth loss during training, increasing stability, while maintaining characteristics like the transitioning from coarse-to-fine level emphasis during training, learning first high-level details and then specific details that differentiate the fine level classes. The hierarchical models produce outputs for each hierarchical level, which can lead to more interpretable results. Results suggest that providing semantic hierarchies can improve fine level accuracy on CNNs, while bringing relevant hierarchical information from their other coarser level outputs.
引用
收藏
页数:8
相关论文
共 20 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]  
[Anonymous], 2018, ARXIV180205800
[3]  
[Anonymous], 2015, Tech. Rep.
[4]  
[Anonymous], 2018, ARXIV180600712
[5]   Performance Analysis of Google Colaboratory as a Tool for Accelerating Deep Learning Applications [J].
Carneiro, Tiago ;
Medeiros Da Nobrega, Raul Victor ;
Nepomuceno, Thiago ;
Bian, Gui-Bin ;
De Albuquerque, Victor Hugo C. ;
Reboucas Filho, Pedro Pedrosa .
IEEE ACCESS, 2018, 6 :61677-61685
[6]   Do Semantic Parts Emerge in Convolutional Neural Networks? [J].
Gonzalez-Garcia, Abel ;
Modolo, Davide ;
Ferrari, Vittorio .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2018, 126 (05) :476-494
[7]   Feasibility study on quality evaluation of Jadeite-jade color green based on GemDialogue color chip [J].
Guo, Ying ;
Zong, Xiang ;
Qi, Ming .
MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (01) :841-856
[8]  
Hinton G., 2009, Handbook of Systemic Autoimmune Diseases
[9]   DualNet: Learn Complementary Features for Image Recognition [J].
Hou, Saihui ;
Liu, Xu ;
Wang, Zilei .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :502-510
[10]  
Ioffe Sergey, 2015, Proceedings of Machine Learning Research, V37, P448, DOI DOI 10.48550/ARXIV.1502.03167