GOING SHALLOWER WITH MOBILENETS: ON THE IMPACT OF WAVELET POOLING

被引:4
作者
El-Khamy, Said [1 ]
Al-Kabbany, Ahmad [2 ,3 ,4 ]
EL-Band, Shimaa [5 ]
机构
[1] Alexandria Univ, Fac Engn, Alexandria 11432, Egypt
[2] Arab Acad Sci Technol & Maritime Transport, Intelligent Syst Lab, Alexandria 21937, Egypt
[3] Arab Acad Sci Technol & Maritime Transport, Elect & Commun Engn Dept, Alexandria 21937, Egypt
[4] VRapeutic, Dept Res & Dev, Cairo 11613, Egypt
[5] Alexandria Higher Inst Engn & Technol AIET, Alexandria 21311, Egypt
来源
PROCEEDINGS OF 2021 38TH NATIONAL RADIO SCIENCE CONFERENCE (NRSC) | 2021年
关键词
MobileNet; CIFAR; wavelet; pooling; shallow; classification;
D O I
10.1109/NRSC52299.2021.9509825
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
MobileNet is a light-weight neural network model that has facilitated harnessing the power of deep learning on mobile devices. The advances in pervasive computing and the ever-increasing interest in deep learning has resulted in a growing research attention on the enhancement of the MobileNet architecture. Beside the enhancement in convolution layers, recent literature has featured new directions for implementing the pooling layers. In this work, we propose a new model based on the MobileNet-V1 architecture, and we investigate the impact of wavelet pooling on the performance of the proposed model. While traditional neighborhood pooling can result in information loss, which negatively impacts any succeeding feature extraction, wavelet pooling allows us to utilize spectral information which is useful in most image processing tasks. On two widely adopted datasets, we evaluated the performance of the proposed model, and compared to the baseline MobileNet, we attained a 10% and a 16% increase in classification accuracy on CIFAR-10 and CIFAR-100 respectively. We also evaluated a shallow version of the proposed architecture with wavelet pooling, and we showed that it maintained the classification accuracy either higher than, or <1% less than, the deep versions of MobileNet while decreasing the number of model parameters by almost 40%.
引用
收藏
页码:126 / 138
页数:13
相关论文
共 19 条
  • [1] Deep Learning Framework for Vehicle and Pedestrian Detection in Rural Roads on an Embedded GPU
    Barba-Guaman, Luis
    Eugenio Naranjo, Jose
    Ortiz, Anthony
    [J]. ELECTRONICS, 2020, 9 (04)
  • [2] Chen HY, 2018, INT CONF AWARE SCI, P308, DOI 10.1109/ICAwST.2018.8517177
  • [3] A Two-Stage Framework for Automated Malignant Pulmonary Nodule Detection in CT Scans
    EL-Bana, Shimaa
    Al-Kabbany, Ahmad
    Sharkas, Maha
    [J]. DIAGNOSTICS, 2020, 10 (03)
  • [4] Freeman I, 2018, IEEE IMAGE PROC, P6, DOI 10.1109/ICIP.2018.8451339
  • [5] Fujieda S., 2018, Wavelet convolutional neural networks
  • [6] Howard A.G., 2017, CoRR abs/1704.04861
  • [7] Densely Connected Convolutional Networks
    Huang, Gao
    Liu, Zhuang
    van der Maaten, Laurens
    Weinberger, Kilian Q.
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2261 - 2269
  • [8] Jin J., 2014, Flattened Convolutional Neural Networks for Feedforward Acceleration
  • [9] Krizhevsky A., 2009, LEARNING MULTIPLE LA
  • [10] ImageNet Classification with Deep Convolutional Neural Networks
    Krizhevsky, Alex
    Sutskever, Ilya
    Hinton, Geoffrey E.
    [J]. COMMUNICATIONS OF THE ACM, 2017, 60 (06) : 84 - 90