Analysis of non-linear activation functions for classification tasks using convolutional neural networks

被引:9
|
作者
Dureja A. [1 ]
Pahwa P. [2 ]
机构
[1] Computer Science & Engineering, USICT, GGSIPU, New Delhi
[2] Computer Science & Engineering, BPIT, Rohini, New Delhi
关键词
Activation function; CNN; Deep neural networks; Hidden layers; Machine learning; Non-linear problems;
D O I
10.2174/2213275911666181025143029
中图分类号
学科分类号
摘要
Background: In making the deep neural network, activation functions play an important role. But the choice of activation functions also affects the network in term of optimization and to retrieve the better results. Several activation functions have been introduced in machine learning for many practical applications. But which activation function should use at hidden layer of deep neural networks was not identified. Objective: The primary objective of this analysis was to describe which activation function must be used at hidden layers for deep neural networks to solve complex non-linear problems. Methods: The configuration for this comparative model was used by using the datasets of 2 classes (Cat/Dog). The number of Convolutional layer used in this network was 3 and the pooling layer was also introduced after each layer of CNN layer. The total of the dataset was divided into the two parts. The first 8000 images were mainly used for training the network and the next 2000 images were used for testing the network. Results: The experimental comparison was done by analyzing the network by taking different activation functions on each layer of CNN network. The validation error and accuracy on Cat/Dog dataset were analyzed using activation functions (ReLU, Tanh, Selu, PRelu, Elu) at number of hidden layers. Overall the Relu gave best performance with the validation loss at 25th Epoch 0.3912 and validation accuracy at 25th Epoch 0.8320. Conclusion: It is found that a CNN model with ReLU hidden layers (3 hidden layers here) gives best results and improve overall performance better in term of accuracy and speed. These advantages of ReLU in CNN at number of hidden layers are helpful to effectively and fast retrieval of images from the databases. © 2019 Bentham Science Publishers.
引用
收藏
页码:156 / 161
页数:5
相关论文
共 50 条
  • [1] Learning continuous piecewise non-linear activation functions for deep neural networks
    Gao, Xinchen
    Li, Yawei
    Li, Wen
    Duan, Lixin
    Van Gool, Luc
    Benini, Luca
    Magno, Michele
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1835 - 1840
  • [2] NISQ-Friendly Non-Linear Activation Functions for Quantum Neural Networks
    Sajadimanesh, Sohrab
    Faye, Jean Paul Latyr
    Atoofian, Ehsan
    2022 IEEE INTERNATIONAL CONFERENCE ON NETWORKING, ARCHITECTURE AND STORAGE (NAS), 2022, : 121 - 128
  • [3] A Monte Carlo Simulation Approach in Non-linear Structural Dynamics Using Convolutional Neural Networks
    Bamer, Franz
    Thaler, Denny
    Stoffel, Marcus
    Markert, Bernd
    FRONTIERS IN BUILT ENVIRONMENT, 2021, 7
  • [4] Rethinking the Role of Activation Functions in Deep Convolutional Neural Networks for Image Classification
    Zheng, Qinghe
    Yang, Mingqiang
    Tian, Xinyu
    Wang, Xiaochen
    Wang, Deqiang
    ENGINEERING LETTERS, 2020, 28 (01) : 80 - 92
  • [5] Convolutional Neural Networks for Classification of Drones Using Radars
    Raval, Divy
    Hunter, Emily
    Hudson, Sinclair
    Damini, Anthony
    Balaji, Bhashyam
    DRONES, 2021, 5 (04)
  • [6] Investigation of Convolutional Neural Networks in the Tasks of Medical Images Analysis and Classification of Breast Tumors
    Zaychenko, Yuriy
    Zaichenko, Helen
    Hamidov, Galib
    2021 14TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2021), 2021,
  • [7] Classification of Non-functional Requirements Using Convolutional Neural Networks
    Garcia, S. E. Martinez
    Fernandez-y-Fernandez, C. Alberto
    Perez, E. G. Ramos
    PROGRAMMING AND COMPUTER SOFTWARE, 2023, 49 (08) : 705 - 711
  • [8] Classification of Non-functional Requirements Using Convolutional Neural Networks
    S. E. Martínez García
    C. Alberto Fernández-y-Fernández
    E. G. Ramos Pérez
    Programming and Computer Software, 2023, 49 : 705 - 711
  • [9] Alternating Transfer Functions to Prevent Overfitting in Non-Linear Regression with Neural Networks
    Seitz, Philipp
    Schmitt, Jan
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2023,
  • [10] Sound Classification Using Convolutional Neural Networks
    Jaiswal, Kaustumbh
    Patel, Dhairya Kalpeshbhai
    2018 SEVENTH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING IN EMERGING MARKETS (CCEM), 2018, : 81 - 84