Cross-modal hybrid architectures for gastrointestinal tract image analysis: A systematic review and futuristic applications

被引:1
作者
Nemani, Praneeth [1 ]
Vadali, Venkata Surya Sundar [2 ]
Medi, Prathistith Raj [3 ]
Marisetty, Ashish [3 ]
Vollala, Satyanarayana [2 ]
Kumar, Santosh [2 ]
机构
[1] Univ Colorado Boulder, Coll Engn & Appl Sci, Boulder, CO 80309 USA
[2] IIIT Naya Raipur, Dept Comp Sci & Engn, Uparwara, India
[3] IIIT Naya Raipur, Dept Data Sci & Artificial Intelligence, Uparwara, India
关键词
Segmentation; CNNs; Transformers; Generative AI; Hybrid architectures; Dataset; GI-Tract; ENDOSCOPIC RESECTION; FEATURE-EXTRACTION; U-NET; SEGMENTATION; DEEP; CHALLENGES; POLYPS; NETWORKS;
D O I
10.1016/j.imavis.2024.105068
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This review paper presents an in-depth exploration of gastrointestinal (GI) tract image analysis, particularly emphasizing organ and polyp segmentation. It addresses the inherent challenges posed by the GI tract's complex anatomy and diverse pathologies, which complicate accurate image analysis. Central to this review is the examination of hybrid computational models that integrate convolutional neural networks (CNNs) and Transformers. This synergy enhances the accuracy of segmenting intricate structures in GI tract imaging, marking a significant advancement in the field. A notable contribution of this review is the systematic categorization and analysis of the latest methodologies in organ and polyp segmentation. It provides a comprehensive overview of various techniques, highlighting their strengths and limitations in addressing the specifications of GI tract imaging. This survey serves as a valuable reference for researchers, outlining current practices and offering insights for future innovations. The review also underscores the critical role of extensive and diverse datasets in advancing GI tract image analysis. It stresses the need for high-quality datasets to effectively train and evaluate emerging models, addressing the broad spectrum of GI tract conditions. Moreover, the review delves into the burgeoning area of Generative AI, exploring its potential to enrich datasets and enhance segmentation models. Future developments in GI tract segmentation will focus on refining hybrid CNN-Transformer models and creating larger, more diverse datasets for better model training. Specialized focus on specific segmentation challenges, like polyp and organ segmentation, is anticipated. The field will explore Generative AI applications for innovative segmentation approaches. Collaborative efforts between technologists and clinicians will enhance practical clinical integration and applicability.
引用
收藏
页数:14
相关论文
共 126 条
[41]   PPNet: Pyramid pooling based network for polyp segmentation [J].
Hu, Keli ;
Chen, Wenping ;
Sun, YuanZe ;
Hu, Xiaozhao ;
Zhou, Qianwei ;
Zheng, Zirui .
COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 160
[42]  
Huang C.H., 2021, arXiv
[43]   Wireless capsule endoscopy [J].
Iddan, G ;
Meron, G ;
Glukhovsky, A ;
Swain, P .
NATURE, 2000, 405 (6785) :417-417
[44]   Endoscopic resection of large pedunculated colorectal polyps using a detachable snare [J].
Iishi, H ;
Tatsuta, M ;
Narahara, H ;
Iseki, K ;
Sakai, N .
GASTROINTESTINAL ENDOSCOPY, 1996, 44 (05) :594-597
[45]   A Comprehensive Study on Colorectal Polyp Segmentation With ResUNet plus plus , Conditional Random Field and Test-Time Augmentation [J].
Jha, Debesh ;
Smedsrud, Pia H. ;
Johansen, Dag ;
de Lange, Thomas ;
Johansen, Havard D. ;
Halvorsen, Pal ;
Riegler, Michael A. .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (06) :2029-2040
[46]   DoubleU-Net: A Deep Convolutional Neural Network for Medical Image Segmentation [J].
Jha, Debesh ;
Riegler, Michael A. ;
Johansen, Dag ;
Halvorsen, Pal ;
Johansen, Havard D. .
2020 IEEE 33RD INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS(CBMS 2020), 2020, :558-564
[47]   Kvasir-SEG: A Segmented Polyp Dataset [J].
Jha, Debesh ;
Smedsrud, Pia H. ;
Riegler, Michael A. ;
Halvorsen, Pal ;
de Lange, Thomas ;
Johansen, Dag ;
Johansen, Havard D. .
MULTIMEDIA MODELING (MMM 2020), PT II, 2020, 11962 :451-462
[48]   ResUNet plus plus : An Advanced Architecture for Medical Image Segmentation [J].
Jha, Debesh ;
Smedsrud, Pia H. ;
Riegler, Michael A. ;
Johansen, Dag ;
de Lange, Thomas ;
Halvorsen, Pal ;
Johansen, Havard D. .
2019 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM 2019), 2019, :225-230
[49]  
Jia X, 2017, I S BIOMED IMAGING, P179, DOI 10.1109/ISBI.2017.7950496
[50]  
Jia X, 2016, IEEE ENG MED BIO, P639, DOI 10.1109/EMBC.2016.7590783