Less Is More: Matched Wavelet Pooling-Based Light-Weight CNNs With Application to Image Classification

被引:3
作者
El-Khamy, Said [1 ]
Al-Kabbany, Ahmad [2 ,3 ,4 ]
El-Bana, Shimaa [5 ]
机构
[1] Alexandria Univ, Dept Elect Engn, Alexandria 21544, Egypt
[2] Arab Acad Sci Technol & Maritime Transport, Intelligent Syst Lab, Alexandria 21937, Egypt
[3] Arab Acad Sci Technol & Maritime Transport, Dept Elect & Commun Engn, Alexandria 21937, Egypt
[4] VRapeutic Inc, Dept Res & Dev, Ottawa, ON K1V 6T8, Canada
[5] Alexandria Higher Inst Engn & Technol, Alexandria 21311, Egypt
关键词
Discrete wavelet transforms; Convolution; Computer architecture; Training; Low-pass filters; Testing; Deep learning; CNNs; classification; MobileNet; pooling; wavelet; spectral information;
D O I
10.1109/ACCESS.2022.3180498
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We are concerned with the development of effective training strategies for light-weight deep networks, which facilitates harnessing the power of deep learning on hand-held devices. A vast literature has focused on enhancing the performance of deep convolution models (CNNs) by introducing changes in the model architecture and/or developing new training procedures. Recent literature has witnessed a growing focus on pooling layers, one example for this is wavelet pooling. Pooling is often adopted by CNNs to enlarge the receptive field, yet traditional pooling methods cause information loss which undermines further operations such as feature extraction. By capitalizing on spectral information, wavelet pooling (WP) has addressed this drawback in neighborhood pooling, and we have demonstrated its positive impact on light-weight CNN architectures such as MobileNets in previous research. In this paper, we report the following significant observation: Including all sub-bands of the wavelet decomposition does not necessarily yield higher accuracies than using certain sub- set of sub-bands, i.e., less is more. By identifying which sub-bands to be included in the pooling process for every image during training and testing, we propose a new training and inference procedures, namely, Matched Wavelet Pooling (MWP), for light-weight CNN architectures, and in particular, MobileNets. As an example, the image classification application is considered. On two widely adopted datasets, the proposed MWP algorithm consistently outperform previous pooling methods and attained an average of 20% accuracy increase compared to the recent literature.
引用
收藏
页码:59592 / 59602
页数:11
相关论文
共 38 条
  • [1] Amer H., ARXIV210408314, P2021
  • [2] [Anonymous], 2018, Cinic-10 is not imagenet or cifar-10
  • [3] Less is More: Video Trimming for Action Recognition
    Antic, Borislav
    Milbich, Timo
    Ommer, Bjoern
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2013, : 515 - 521
  • [4] Deep Learning Framework for Vehicle and Pedestrian Detection in Rural Roads on an Embedded GPU
    Barba-Guaman, Luis
    Eugenio Naranjo, Jose
    Ortiz, Anthony
    [J]. ELECTRONICS, 2020, 9 (04)
  • [5] Chen HY, 2018, INT CONF AWARE SCI, P308, DOI 10.1109/ICAwST.2018.8517177
  • [6] Coates A., 2011, JMLR WORKSHOP C P, P215
  • [7] A multi-task pipeline with specialized streams for classification and segmentation of infection manifestations in COVID-19 scans
    El-bana, Shimaa
    Al-Kabbany, Ahmad
    Sharkas, Maha
    [J]. PEERJ COMPUTER SCIENCE, 2020, 6 : 1 - 27
  • [8] A Two-Stage Framework for Automated Malignant Pulmonary Nodule Detection in CT Scans
    EL-Bana, Shimaa
    Al-Kabbany, Ahmad
    Sharkas, Maha
    [J]. DIAGNOSTICS, 2020, 10 (03)
  • [9] El-Khamy S. E., 2021, PROC INT TELECOMMUN, P1
  • [10] GOING SHALLOWER WITH MOBILENETS: ON THE IMPACT OF WAVELET POOLING
    El-Khamy, Said
    Al-Kabbany, Ahmad
    EL-Band, Shimaa
    [J]. PROCEEDINGS OF 2021 38TH NATIONAL RADIO SCIENCE CONFERENCE (NRSC), 2021, : 126 - 138