Semantic Hierarchy-based Convolutional Neural Networks for Image Classification

被引:2
作者
Inoue, Matheus [1 ,2 ]
Forster, Carlos Henrique [2 ]
dos Santos, Antonio Carlos [3 ]
机构
[1] Univ Sao Paulo, Polytech Sch, Sao Paulo, Brazil
[2] Aeronaut Inst Technol, Comp Sci Div, Sao Jose Dos Campos, Brazil
[3] Itau Unibanco, Data Sci Team, Sao Paulo, Brazil
来源
2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2020年
关键词
Convolutional Neural Networks; Hierarchical Image classification; Deep Learning; Computer Vision;
D O I
10.1109/ijcnn48605.2020.9207246
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, three variations of hierarchical topologies of Convolutional Neural Networks (CNNs), two of which being original proposals introduced by this work, were tested to assess their impact on image classification problems. The hierarchical structure groups the images based on the semantic meaning of the classes, from the coarsest classes to the finest classes, forming hierarchical levels. The hierarchical models made were compared to a baseline regular CNN on benchmark image classification datasets, the Fashion-MNIST and CIFAR-100 datasets. Another contribution of this work is a new training strategy for hierarchical CNNs, that aims to be simple to implement and to produce a smooth loss during training, increasing stability, while maintaining characteristics like the transitioning from coarse-to-fine level emphasis during training, learning first high-level details and then specific details that differentiate the fine level classes. The hierarchical models produce outputs for each hierarchical level, which can lead to more interpretable results. Results suggest that providing semantic hierarchies can improve fine level accuracy on CNNs, while bringing relevant hierarchical information from their other coarser level outputs.
引用
收藏
页数:8
相关论文
共 20 条
[11]   Tree-CNN: from generalization to specialization [J].
Jiang, Shenwang ;
Xu, Tingfa ;
Guo, Jie ;
Zhang, Jizhou .
EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2018,
[12]  
LeCun Y., 2010, MNIST handwritten digit database, V2
[13]  
Rawat W, 2017, NEURAL COMPUT, V29, P2352, DOI [10.1162/neco_a_00990, 10.1162/NECO_a_00990]
[14]   Cyclical Learning Rates for Training Neural Networks [J].
Smith, Leslie N. .
2017 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2017), 2017, :464-472
[15]  
Srivastava N, 2014, J MACH LEARN RES, V15, P1929
[16]  
Wang L, LEARNING IMAGE REPRE
[17]  
Xiao H., 2017, Fashion-mnist: a novel image dataset for benchmarking machine learning algorithms
[18]   HD-CNN: Hierarchical Deep Convolutional Neural Networks for Large Scale Visual Recognition [J].
Yan, Zhicheng ;
Zhang, Hao ;
Piramuthu, Robinson ;
Jagadeesh, Vignesh ;
DeCoste, Dennis ;
Di, Wei ;
Yu, Yizhou .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2740-2748
[19]   A new image classification model based on brain parallel interaction mechanism [J].
Yu, Yingchao ;
Hao, Kuangrong ;
Ding, Yongsheng .
NEUROCOMPUTING, 2018, 315 :190-197
[20]  
Zhu X., 2017, ARXIV170909890