Feature amplification capsule network for complex images

被引:1
作者
Ayidzoe, Mighty Abra [1 ,2 ]
Yu, Yongbin [1 ]
Mensah, Patrick Kwabena [2 ]
Cai, Jingye [1 ]
Kwabena, Adu [1 ]
Tashi, Nyima [3 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Software Engn, Chengdu, Peoples R China
[2] Univ Energy & Nat Resources, Dept Comp Sci & Informat, Sunyani, Ghana
[3] Tibet Univ, Sch Informat Sci & Technol, Lhasa, Peoples R China
基金
中国国家自然科学基金;
关键词
Capsule network; convolutional neural network; squash function; feature amplification;
D O I
10.3233/JIFS-202080
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Availability of massive amounts of data is a key contributing factor that influences the performance of deep learning models. Convolutional Neural Networks for instance, require large amounts of data in different variations to enable them generalize well to viewpoints. However, in health and other application domains, data generation and processing tasks are time-consuming and requires annotation by experts. Capsule Network (CapsNet) have been proposed to curtail the limitations of Convolutional Neural Networks (CNNs). Due to the problem of crowding, capsule Networks perform badly on complex and real-life images such as CIFAR 10 and some medical images. In this study, a variant of a capsule network with a new algorithm referred to as the amplifier and a new squash function termed exponential squash is proposed. The amplifier is implemented in the encoder network to improve the texture of the images and has the ability to assign low relevance to irrelevant features and high relevance to vital features. The exponential squash function reduces the coupling strength of unrelated capsules in the lower and upper capsule layer. The proposed algorithm was evaluated on four datasets; CIFAR 10, fashion-MNIST, eye disease dataset and ODIR dataset achieving accuracies of 84.56% 93.76%, 89.02% and 87.27% respectively. This work sheds light on the possibility of applying CapsNet on complex real-world tasks. The proposed model can serve as an intelligent tool to aid medical personnel to diagnose eye disease and apply the necessary treatments.
引用
收藏
页码:10955 / 10968
页数:14
相关论文
共 27 条
  • [1] [Anonymous], 2018, IEEE SIGNAL PROCESS
  • [2] Bing S., 2017, Capsule Network Performance on Complex Data, V10707, P1
  • [3] Cao S., 2019, E2 CAPSULE NEURAL NE, V00, P1
  • [4] Multi-Lane Capsule Network for Classifying Images With Complex Background
    Chang, Siwei
    Liu, Jin
    [J]. IEEE ACCESS, 2020, 8 : 79876 - 79886
  • [5] Deborshi G., 2019, APPL CAPSULE NETWORK
  • [6] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [7] Hinton G. E., 2018, INT C LEARN REPR, P1, DOI DOI 10.2514/1.562
  • [8] Hinton GE, 2011, LECT NOTES COMPUT SC, V6791, P44, DOI 10.1007/978-3-642-21735-7_6
  • [9] Kondwani, 2020, EYE DIS DAT EYE DIS
  • [10] Krizhevsky A., 2009, Masters Thesis