Gen-CNN: a framework for the automatic generation of CNNs for image classification

被引:0
|
作者
Rogelio García-Aguirre [1 ]
Eva María Navarro-López [2 ]
Luis Torres-Treviño [3 ]
机构
[1] Facultad de Ingeniería Mecánica y Eléctrica, Universidad Autónoma de Nuevo León, Ave. Universidad S/N, San Nicolás de los Garza, Nuevo León
[2] School of Interactive Games and Media, Golisano College of Computing and Information Sciences, Rochester Institute of Technology, 20 Lomb Memorial Drive, Rochester, 14623, NY
[3] School of Environment, Education and Development, University of Manchester, Oxford Road, Manchester
关键词
Convolutional neural network; Genetic algorithm; Hyperparameter optimization; Image classification;
D O I
10.1007/s00521-024-10398-6
中图分类号
学科分类号
摘要
Convolutional neural networks (CNNs) have become widely adopted for computer vision tasks. However, the vast amount of design choices and the complex interactions among their hyperparameters, which ultimately influence the model’s performance, impede their accessibility to users who are not experts in machine learning (ML). To address this challenge, we present AutoML as a solution, leveraging hyperparameter optimization (HPO) for effective parameter selection. Particularly good at handling non-convex, non-differentiable optimization tasks, genetic algorithms are easy to implement and parallelize, making them well suited for deep learning applications. In this context, we introduce Gen-CNN, an AutoML framework based on a genetic algorithm that generates CNN models for image classification. Our framework incorporates transfer learning and operates in a low-compute regime to accelerate the hyperparameter optimization phase. We test Gen-CNN on four datasets, including Sign Language Digits for convergence assessment and KVASIR-v2, ISIC-2019, and BreakHis for performance evaluation. Our results prove that Gen-CNN automatically generates CNN models with classification performance comparable to state-of-the-art custom models already published in the literature. Moreover, in the recommended testing regime for heuristic optimization techniques, we surpassed other HPO algorithms by achieving better mean categorical accuracy. Gen-CNN code is available at—omitted for anonymous review. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.
引用
收藏
页码:149 / 168
页数:19
相关论文
共 50 条
  • [41] CNN Based Image Classification of Malicious UAVs
    Brown, Jason
    Gharineiat, Zahra
    Raj, Nawin
    APPLIED SCIENCES-BASEL, 2023, 13 (01):
  • [42] A Modification-Free Steganography Algorithm Based on Image Classification and CNN
    Wu, Jian Bin
    Zhang, Yang
    Luo, Chu Wei
    Yuan, Lin Feng
    Shen, Xiao Kang
    INTERNATIONAL JOURNAL OF DIGITAL CRIME AND FORENSICS, 2021, 13 (03) : 47 - 58
  • [43] Remote Sensing Image Classification Method Based on Fusion of CNN and Transformer
    Jin Chuan
    Tong Changqing
    LASER & OPTOELECTRONICS PROGRESS, 2023, 60 (20)
  • [44] Morph-CNN: A Morphological Convolutional Neural Network for Image Classification
    Mellouli, Dorra
    Hamdani, Tarek M.
    Ben Ayed, Mounir
    Alimi, Adel M.
    NEURAL INFORMATION PROCESSING (ICONIP 2017), PT II, 2017, 10635 : 110 - 117
  • [45] Application of CNN based image classification technique for oil spill detection
    Das, K.
    Janardhan, P.
    Narayana, H.
    INDIAN JOURNAL OF GEO-MARINE SCIENCES, 2023, 52 (01): : 5 - 14
  • [46] Image Classification with Caffe Deep Learning Framework
    Cengil, Emine
    Cinar, Ahmet
    Ozbay, Erdal
    2017 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2017, : 440 - 444
  • [47] Medical Image Classification with a Hybrid SSM Model Based on CNN and Transformer
    Hu, Can
    Cao, Ning
    Zhou, Han
    Guo, Bin
    ELECTRONICS, 2024, 13 (15)
  • [48] Development and Classification of Image Dataset for Text-to-Image Generation
    Kumar M.
    Mittal M.
    Singh S.
    Journal of The Institution of Engineers (India): Series B, 2024, 105 (04) : 787 - 796
  • [49] Transformer-Based Fused Attention Combined with CNNs for Image Classification
    Jielin Jiang
    Hongxiang Xu
    Xiaolong Xu
    Yan Cui
    Jintao Wu
    Neural Processing Letters, 2023, 55 : 11905 - 11919
  • [50] RanNet: Learning Residual-Attention Structure in CNNs for Automatic Modulation Classification
    Huynh-The, Thien
    Pham, Quoc-Viet
    Nguyen, Toan-Van
    Nguyen, Thanh Thi
    da Costa, Daniel Benevides
    Kim, Dong-Seong
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2022, 11 (06) : 1243 - 1247