Gabor capsule network with preprocessing blocks for the recognition of complex images

被引:25
作者
Abra Ayidzoe, Mighty [1 ,2 ]
Yu, Yongbin [1 ]
Mensah, Patrick Kwabena [2 ]
Cai, Jingye [1 ]
Adu, Kwabena [1 ]
Tang, Yifan [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Software Engn, Chengdu, Peoples R China
[2] Univ Energy & Nat Resources, Dept Comp Sci & Informat, Sunyani, Ghana
基金
中国国家自然科学基金;
关键词
Capsule network; Gabor filter; Convolutional neural network;
D O I
10.1007/s00138-021-01221-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Capsule network (CapsNet) is a novel concept demonstrating the importance of learning spatial hierarchical relationship between features for the effective recognition of images. However, the baseline capsule network is not suitable for the recognition of complex images leading to its poor performance on such images. This limitation can partially be attributed to the inability of CapsNets to extract important features from the input images as well as the attempt to account for every object in the image including background objects. To address these problems, we propose a variant of a capsule network that is less complex yet robust with strong feature extraction capabilities. The model uses the advantages of Gabor filter and custom preprocessing block to learn the structure and semantic information in the image. This enhances the extraction of only important features, resulting in improved activation diagrams that enable meaningful hierarchical information to be learned. Experimental results show that the proposed model can achieve 85.24%, 68.17%, 94.78% and 91.50% test accuracies on complex images such as CIFAR 10, CIFAR 100, fashion-MNIST and kvasir-dataset-v2 datasets, respectively. The performance of the proposed model is comparable to that of the state-of-the-art models on the five datasets with a relatively small number of parameters.
引用
收藏
页数:16
相关论文
共 36 条
[1]  
Ahmed K, 2019, ADV NEUR IN, V32
[2]  
[Anonymous], P ADV NEUR INF PROC
[3]  
[Anonymous], 2016, P 29 IEEE C COMPUTER
[4]  
Asperti A., 2018, Proceedings of the 11th International Joint Conference on Biomedical Engineering Systems and Technologies, P199, DOI DOI 10.5220/0006730901990205
[5]  
Buades A, 2005, AVSS 2005: ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE, PROCEEDINGS, P70
[6]  
Cao S., 2019, E2 CAPSULE NEURAL NE, P1
[7]   Multi-Lane Capsule Network for Classifying Images With Complex Background [J].
Chang, Siwei ;
Liu, Jin .
IEEE ACCESS, 2020, 8 :79876-79886
[8]   Gastrointestinal Tract Diseases Detection with Deep Attention Neural Network [J].
Chang, Yuan ;
Huang, Zixuan ;
Chen, Weizhao ;
Shen, Qiwei .
PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, :2568-2572
[9]  
Coughlan Gillian, 2018, Dement. neuropsychol., V12, P85
[10]  
Deborshi G., 2019, Application of Capsule Networks for Image Classification on Complex Datasets