Capsule networks for computer vision applications: a comprehensive review

被引:0
作者
Seema Choudhary
Sumeet Saurav
Ravi Saini
Sanjay Singh
机构
[1] Academy of Scientific and Innovative Research (AcSIR),
[2] CSIR-Central Electronics Engineering Research Institute (CSIR-CEERI),undefined
来源
Applied Intelligence | 2023年 / 53卷
关键词
Deep learning; Convolutional neural network; Capsule networks; Routing-by-agreement;
D O I
暂无
中图分类号
学科分类号
摘要
Convolutional neural networks (CNNs) have achieved human-level performance in various computer vision tasks, such as image classification, object detection & segmentation, etc. However, efficient CNN training requires a large amount of annotated data. Also, the CNNs, without explicit data augmentation, are bad at handling rotation and scale invariance. Besides, these neural networks do not learn the important spatial correlations between simple and complex objects in images. Recently, researchers introduced Capsule Network (CapsNet) to overcome the limitations of CNNs. CapsNet uses vector activation functions where the vectors’ length and orientation represent the entities’ existence and properties. Recent advances in the routing algorithms of CapsNets have increased their usefulness in solving complex computer vision problems. One can gauge their importance from numerous recently published articles in top-rank conferences and journals. Also, researchers have published a few review articles that discuss the structural and implementation details of CapsNets. This review focuses on the applications of CapsNet in computer vision. We first present a brief note on CNNs and their limitations, followed by basic structural and implementation details of CapsNet, including routing algorithms. Subsequently, the study investigates details of CapsNet variants that have evolved in recent years and their applications in different computer vision tasks. Finally, the paper presents a short commentary on the advantages, disadvantages, and limitations of CapsNet and outlines future research directions in the area of CapsNet.
引用
收藏
页码:21799 / 21826
页数:27
相关论文
共 302 条
[31]  
Salvetti F(2019)Hyperspectral classification based on lightweight 3-d-cnn with transfer learning IEEE Trans Geosci Remote Sens 57 5813-931
[32]  
Chiaberge M(2017)Spectral-spatial residual network for hyperspectral image classification: A 3-d deep learning framework IEEE Trans Geosci Remote Sens 56 847-32147
[33]  
Hahn T(2018)Capsule networks for hyperspectral image classification IEEE Trans Geosci Remote Sens 57 2145-176599
[34]  
Pyeon M(2018)Generative adversarial networks for hyperspectral image classification IEEE Trans Geosci Remote Sens 56 5046-71363
[35]  
Kim G(1995)Support-vector networks Mach Learn 20 273-359
[36]  
Cheng X(2020)Capsule networks for object segmentation using virtual world dataset Sensors & Transducers 244 20-32
[37]  
He J(2011)The lung image database consortium (LIDC) and image database resource initiative (IDRI): a completed reference database of lung nodules on CT scans Med Phys 38 915-186
[38]  
He J(2021)Residual-capsule networks with threshold convolution for segmentation of wheat plantation rows in UAV images Multimed Tools Appl 80 32131-74
[39]  
Xu H(2019)Hyperspectral image classification with pre-activation residual attention network IEEE Access 7 176587-94
[40]  
Do Rosario VM(2020)Small sample classification of hyperspectral remote sensing images based on sequential joint deeping learning model IEEE Access 8 71353-14