Inception recurrent convolutional neural network for object recognition

被引:0
|
作者
Md Zahangir Alom
Mahmudul Hasan
Chris Yakopcic
Tarek M. Taha
Vijayan K. Asari
机构
[1] University of Dayton,Department of Electrical and Computer Engineering
[2] Comcast Labs,undefined
来源
Machine Vision and Applications | 2021年 / 32卷
关键词
IRCNN; RCNN; DCNN; Deep Learning; Object recognition;
D O I
暂无
中图分类号
学科分类号
摘要
Deep convolutional neural network (DCNN) is an influential tool for solving various problems in machine learning and computer vision. Recurrent connectivity is a very important component of visual information processing within the human brain. The idea of recurrent connectivity is rarely applied within convolutional layers, the exceptions being a couple of DCNN architectures including recurrent convolutional neural network (RCNN) in Liang and Hu (in: Proceedings of the IEEE conference on computer vision and pattern recognition, 2015) and Pinheiro and Collobert (in: ICML, 2014). On the other hand, the Inception network architecture has become popular among the computer vision community (Szegedy et al. in Inception-v4, Inception-ResNet and the impact of Residual connections on learning, 2016. arXiv:1602.07261). In this paper, we introduce a deep learning architecture called the Inception Recurrent Convolutional Neural Network (IRCNN), which utilizes the power of an Inception network combined with recurrent convolutional layers. Although the inputs are static, the recurrent property plays a huge role in modeling the contextual information for object recognition tasks and thus improves overall training and testing accuracy. In addition, this proposed architecture generalizes both Inception and RCNN models. We have empirically evaluated the recognition performance of the proposed IRCNN model using different benchmark datasets such as MNIST, CIFAR-10, CIFAR-100, and SVHN. The experimental results show higher recognition accuracy when compared to most of the popular DCNNs including the RCNN. Furthermore, we have investigated IRCNN performance against equivalent Inception networks (EIN) and equivalent Inception–Residual networks (EIRN) using the CIFAR-100 dataset. When using the augmented CIFAR-100 dataset, we achieved about 3.5%, 3.47% and 2.54% improvement in classification accuracy compared to the RCNN, EIN, and EIRN respectively. We have also conducted experiment on Tiny ImageNet-200 dataset with IRCNN, EIN, EIRN, RCNN, DenseNet in Huang et al. (Densely connected convolutional networks, 2016. arXiv:1608.06993), and DenseNet with Recurrent Convolution Layer, where the proposed model shows significantly better performance against baseline models.
引用
收藏
相关论文
共 50 条
  • [21] Recognition and Classification of Concrete Surface Cracks with an Inception Quantum Convolutional Neural Network Algorithm
    Bu, Yun-zhe
    Xiao, Yi-lei
    Li, Ya-jun
    Meng, Ling-guang
    APPLIED GEOPHYSICS, 2024,
  • [22] 3D convolutional neural network for object recognition: a review
    Rahul Dev Singh
    Ajay Mittal
    Rajesh K. Bhatia
    Multimedia Tools and Applications, 2019, 78 : 15951 - 15995
  • [23] Simultaneous Space Object Recognition and Pose Estimation by Convolutional Neural Network
    Afshar, Roya
    Chu, Zhongyi
    Lu, Shuai
    PROCEEDINGS OF THE 2020 INTERNATIONAL CONFERENCE ON ARTIFICIAL LIFE AND ROBOTICS (ICAROB2020), 2020, : 490 - 495
  • [24] Remote Sensing Image Object Recognition Based on Convolutional Neural Network
    Zhen, Yumei
    Liu, Huanyu
    Li, Junbao
    Hu, Cong
    Pan, Jeng-Shyang
    PROCEEDINGS FIRST INTERNATIONAL CONFERENCE ON ELECTRONICS INSTRUMENTATION & INFORMATION SYSTEMS (EIIS 2017), 2017, : 814 - 817
  • [25] Object recognition through scattering media using convolutional neural network
    Wu, Yulin
    Yan, Huimin
    14TH NATIONAL CONFERENCE ON LASER TECHNOLOGY AND OPTOELECTRONICS (LTO 2019), 2019, 11170
  • [26] Convolutional neural network applied for object recognition in a warehouse of an electric company
    Martinez Piratelo, Paulo Henrique
    de Azeredo, Rodrigo Negri
    Yamao, Eduardo Massashi
    Maidl, Gabriel
    de Jesus, Laercio Pereira
    Penteado Neto, Renato de Arruda
    Coelho, Leandro dos Santos
    Leandro, Gideon Villar
    2021 14TH IEEE INTERNATIONAL CONFERENCE ON INDUSTRY APPLICATIONS (INDUSCON), 2021, : 293 - 299
  • [27] 3D convolutional neural network for object recognition: a review
    Singh, Rahul Dev
    Mittal, Ajay
    Bhatia, Rajesh K.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (12) : 15951 - 15995
  • [28] Image Object and Scene Recognition Based on Improved Convolutional Neural Network
    Li, Guoyan
    Wang, Fei
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2024, 21 (05) : 925 - 937
  • [29] Breast Cancer Classification from Histopathological Images with Inception Recurrent Residual Convolutional Neural Network
    Md Zahangir Alom
    Chris Yakopcic
    Mst. Shamima Nasrin
    Tarek M. Taha
    Vijayan K. Asari
    Journal of Digital Imaging, 2019, 32 : 605 - 617
  • [30] Breast Cancer Classification from Histopathological Images with Inception Recurrent Residual Convolutional Neural Network
    Alom, Md Zahangir
    Yakopcic, Chris
    Nasrin, Shamima
    Taha, Tarek M.
    Asari, Vijayan K.
    JOURNAL OF DIGITAL IMAGING, 2019, 32 (04) : 605 - 617