Zernike Pooling: Generalizing Average Pooling Using Zernike Moments

被引:6
作者
Theodoridis, Thomas [1 ]
Loumponias, Kostas [1 ]
Vretos, Nicholas [1 ]
Daras, Petros [1 ]
机构
[1] Ctr Res & Technol Hellas CERTH, Informat Technol Inst ITI, Thessaloniki 57001, Greece
关键词
Neural networks; pooling; Zernike moments; image classification; RECOGNITION; CLASSIFICATION;
D O I
10.1109/ACCESS.2021.3108630
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most of the established neural network architectures in computer vision are essentially composed of the same building blocks (e.g., convolutional, normalization, regularization, pooling layers, etc.), with their main difference being the connectivity of these components within the architecture and not the components themselves. In this paper we propose a generalization of the traditional average pooling operator. Based on the requirements of effciency (to provide information without repetition), equivalence (to be able to produce the same output as average pooling) and extendability (to provide a natural way of obtaining novel information), we arrive at a formulation that generalizes average pooling using the Zernike moments. Experimental results on Cifar 10, Cifar 100 and Rotated MNIST data-sets showed that the proposed method was able to outperform the two baseline approaches, global average pooling and average pooling 2 x 2, as well as the two variants of Stochastic pooling and AlphaMEX in every case. A worst-case performance analysis on Cifar-100 showed that significant gains in classification accuracy can be realised with only a modest 10% increase in training time.
引用
收藏
页码:121128 / 121136
页数:9
相关论文
共 43 条
  • [1] Interpretation of intelligence in CNN-pooling processes: a methodological survey
    Akhtar, Nadeem
    Ragavendran, U.
    [J]. NEURAL COMPUTING & APPLICATIONS, 2020, 32 (03) : 879 - 898
  • [2] Arora S, 2009, COMPUTATIONAL COMPLEXITY: A MODERN APPROACH, P1, DOI 10.1017/CBO9780511804090
  • [3] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
    Chen, Liang-Chieh
    Papandreou, George
    Kokkinos, Iasonas
    Murphy, Kevin
    Yuille, Alan L.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848
  • [4] Chen LB, 2017, IEEE INT SYMP NANO, P1, DOI 10.1109/NANOARCH.2017.8053709
  • [5] A NOVEL METHOD FOR EARLY DIAGNOSIS OF ALZHEIMER'S DISEASE BASED ON PSEUDO ZERNIKE MOMENT FROM STRUCTURAL MRI
    Gorji, H. T.
    Haddadnia, J.
    [J]. NEUROSCIENCE, 2015, 305 : 361 - 371
  • [6] Grauman K, 2005, IEEE I CONF COMP VIS, P1458
  • [7] An efficient feature extraction method with pseudo-Zernike moment in RBF neural network-based human face recognition system
    Haddadnia, J
    Ahmadi, M
    Faez, K
    [J]. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2003, 2003 (09) : 890 - 901
  • [8] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [9] He KM, 2014, LECT NOTES COMPUT SC, V8691, P346, DOI [arXiv:1406.4729, 10.1007/978-3-319-10578-9_23]
  • [10] Strip Pooling: Rethinking Spatial Pooling for Scene Parsing
    Hou, Qibin
    Zhang, Li
    Cheng, Ming-Ming
    Feng, Jiashi
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4002 - 4011