Less Is More: Matched Wavelet Pooling-Based Light-Weight CNNs With Application to Image Classification

被引:4
作者
El-Khamy, Said [1 ]
Al-Kabbany, Ahmad [2 ,3 ,4 ]
El-Bana, Shimaa [5 ]
机构
[1] Alexandria Univ, Dept Elect Engn, Alexandria 21544, Egypt
[2] Arab Acad Sci Technol & Maritime Transport, Intelligent Syst Lab, Alexandria 21937, Egypt
[3] Arab Acad Sci Technol & Maritime Transport, Dept Elect & Commun Engn, Alexandria 21937, Egypt
[4] VRapeutic Inc, Dept Res & Dev, Ottawa, ON K1V 6T8, Canada
[5] Alexandria Higher Inst Engn & Technol, Alexandria 21311, Egypt
关键词
Discrete wavelet transforms; Convolution; Computer architecture; Training; Low-pass filters; Testing; Deep learning; CNNs; classification; MobileNet; pooling; wavelet; spectral information;
D O I
10.1109/ACCESS.2022.3180498
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We are concerned with the development of effective training strategies for light-weight deep networks, which facilitates harnessing the power of deep learning on hand-held devices. A vast literature has focused on enhancing the performance of deep convolution models (CNNs) by introducing changes in the model architecture and/or developing new training procedures. Recent literature has witnessed a growing focus on pooling layers, one example for this is wavelet pooling. Pooling is often adopted by CNNs to enlarge the receptive field, yet traditional pooling methods cause information loss which undermines further operations such as feature extraction. By capitalizing on spectral information, wavelet pooling (WP) has addressed this drawback in neighborhood pooling, and we have demonstrated its positive impact on light-weight CNN architectures such as MobileNets in previous research. In this paper, we report the following significant observation: Including all sub-bands of the wavelet decomposition does not necessarily yield higher accuracies than using certain sub- set of sub-bands, i.e., less is more. By identifying which sub-bands to be included in the pooling process for every image during training and testing, we propose a new training and inference procedures, namely, Matched Wavelet Pooling (MWP), for light-weight CNN architectures, and in particular, MobileNets. As an example, the image classification application is considered. On two widely adopted datasets, the proposed MWP algorithm consistently outperform previous pooling methods and attained an average of 20% accuracy increase compared to the recent literature.
引用
收藏
页码:59592 / 59602
页数:11
相关论文
共 38 条
[31]  
Williams T., 2018, P INT C LEARN REPR
[32]   Less is More: Learning Highlight Detection from Video Duration [J].
Xiong, Bo ;
Kalantidis, Yannis ;
Ghadiyaram, Deepti ;
Grauman, Kristen .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :1258-1267
[33]   Mixed Pooling for Convolutional Neural Networks [J].
Yu, Dingjun ;
Wang, Hanli ;
Chen, Peiqiu ;
Wei, Zhihua .
ROUGH SETS AND KNOWLEDGE TECHNOLOGY, RSKT 2014, 2014, 8818 :364-375
[34]  
Yu Fisher, 2015, 1511 ARXIV
[35]   RMAF: Relu-Memristor-Like Activation Function for Deep Learning [J].
Yu, Yongbin ;
Adu, Kwabena ;
Tashi, Nyima ;
Anokye, Patrick ;
Wang, Xiangxiang ;
Ayidzoe, Mighty Abra .
IEEE ACCESS, 2020, 8 :72727-72741
[36]  
Zeiler M., 2013, P INT C LEARN REPR
[37]   ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices [J].
Zhang, Xiangyu ;
Zhou, Xinyu ;
Lin, Mengxiao ;
Sun, Ran .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6848-6856
[38]   Multiactivation Pooling Method in Convolutional Neural Networks for Image Recognition [J].
Zhao, Qi ;
Lyu, Shuchang ;
Zhang, Boxue ;
Feng, Wenquan .
WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2018,