Exploring filter placement in convolutional layer topologies based on ResNet for image classification

被引:0
作者
Liu, Haixia [1 ]
Brailsford, Tim [2 ]
Bull, Larry [1 ]
机构
[1] Univ West England, Bristol, England
[2] Univ Kurdistan Hewler, 30 Metre Ave, Erbil, Iraq
关键词
CNN; Topology; Image classification; ResNet; Number of filters; MedMNIST; Medical imaging; NETWORK;
D O I
10.1007/s00138-025-01674-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we investigate the impact that altering the convolutional layer topology has upon the performance of computer vision tasks using a variety of widely used benchmark image datasets. Despite the widespread convention in convolutional neural networks, of incrementally doubling the filter count at each layer, there is little evidence substantiating the superiority of this method over other possible topologies. Our research reveals that a contrarian strategy-reducing the filters by half-can achieve performance on par with, if not superior to, this usual approach. We have extended our investigation to include a variety of novel topological structures. These empirical results challenge the prevailing assumption, that the sequential doubling of number of filters in the network configuration will always yield the best results with all datasets. Our findings advocate for a more nuanced approach to neural network design, incorporating a flexible approach to filter topologies into workflows. This could potentially have a significant impact upon the architectural standards in deep learning for visual recognition tasks.
引用
收藏
页数:11
相关论文
共 31 条
[1]  
Agrawal A., 2020, Using CNN for facial expression recognition: a study of the effects of kernel size and number of filters on accuracy, V36, P405, DOI [10.1007/s00371-019-01630-9, DOI 10.1007/S00371-019-01630-9]
[2]  
Ahmed Wafaa Shihab, 2020, 2020 International Conference on Computer Science and Software Engineering (CSASE). Proceedings, P88, DOI 10.1109/CSASE48920.2020.9142089
[3]   Explainable attention based breast tumor segmentation using a combination of UNet, ResNet, DenseNet, and EfficientNet models [J].
Anari, Shokofeh ;
Sadeghi, Soroush ;
Sheikhi, Ghazaal ;
Ranjbarzadeh, Ramin ;
Bendechache, Malika .
SCIENTIFIC REPORTS, 2025, 15 (01)
[4]   Image channel and spatial information integrated method for fall detection [J].
Cheng, Xinmin ;
Ran, Maoke ;
Chen, Benyao ;
Yin, Hongwei .
MULTIMEDIA SYSTEMS, 2025, 31 (01)
[5]   Dual-Model Synergy for Fingerprint Spoof Detection Using VGG16 and ResNet50 [J].
Cheniti, Mohamed ;
Akhtar, Zahid ;
Chandaliya, Praveen Kumar .
JOURNAL OF IMAGING, 2025, 11 (02)
[6]   Finite-region asynchronous H∞ filtering for 2-D Markov jump systems in Roesser model [J].
Fang, Jiankang ;
Ren, Chengcheng ;
Wang, Hai ;
Stojanovic, Vladimir ;
He, Shuping .
APPLIED MATHEMATICS AND COMPUTATION, 2024, 470
[7]   Biomedical Image Classification via Dynamically Early Stopped Artificial Neural Network [J].
Franchini, Giorgia ;
Verucchi, Micaela ;
Catozzi, Ambra ;
Porta, Federica ;
Prato, Marco .
ALGORITHMS, 2022, 15 (10)
[8]   Reshaping inputs for convolutional neural network: Some common and uncommon methods [J].
Ghosh, Swarnendu ;
Das, Nibaran ;
Nasipuri, Mita .
PATTERN RECOGNITION, 2019, 93 :79-94
[9]   Gray-to-color image conversion in the classification of breast lesions on ultrasound using pre-trained deep neural networks [J].
Gomez-Flores, Wilfrido ;
Pereira, Wagner Coelho de Albuquerque .
MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2023, 61 (12) :3193-3207
[10]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778