Colorectal cancer detection with enhanced precision using a hybrid supervised and unsupervised learning approach

被引:0
作者
Raju, Akella S. Narasimha [1 ]
Venkatesh, K. [2 ]
Gatla, Ranjith Kumar [3 ]
Konakalla, Eswara Prasad [4 ]
Eid, Marwa M. [5 ]
Titova, Nataliia [6 ]
Ghoneim, Sherif S. M. [7 ]
Ghaly, Ramy N. R. [8 ,9 ]
机构
[1] Inst Aeronaut Engn, Dept Comp Sci & Engn Data Sci, Hyderabad 500043, Telangana, India
[2] SRM Inst Sci & Technol, Sch Comp, Dept Networking & Commun, Chennai 603203, Tamil Nadu, India
[3] Inst Aeronaut Engn, Dept Comp Sci & Engn Data Sci, Hyderabad 500043, Telangana, India
[4] BV Raju Coll, Dept Phys & Elect, Garagaparru Rd, Kovvada 534202, Andhra Prades, India
[5] Taif Univ, Coll Appl Med Sci, Taif 21944, Saudi Arabia
[6] Natl Univ Odesa Polytech, Biomed Engn Dept, UA-65044 Odesa, Ukraine
[7] Taif Univ, Coll Engn, Dept Elect Engn, Taif 21944, Saudi Arabia
[8] Mataria Tech Coll, Minist Higher Educ, Cairo 11718, Egypt
[9] Chitkara Univ, Chitkara Ctr Res & Dev, Solan 174103, Himachal Prades, India
来源
SCIENTIFIC REPORTS | 2025年 / 15卷 / 01期
关键词
Colorectal cancer; Integrated CNNs; Transformers; Support vector machines; K-Means clustering; RECOGNITION; COLONOSCOPY; NETWORK; MODEL; CNN;
D O I
10.1038/s41598-025-86590-y
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The current work introduces the hybrid ensemble framework for the detection and segmentation of colorectal cancer. This framework will incorporate both supervised classification and unsupervised clustering methods to present more understandable and accurate diagnostic results. The method entails several steps with CNN models: ADa-22 and AD-22, transformer networks, and an SVM classifier, all inbuilt. The CVC ClinicDB dataset supports this process, containing 1650 colonoscopy images classified as polyps or non-polyps. The best performance in the ensembles was done by the AD-22 + Transformer + SVM model, with an AUC of 0.99, a training accuracy of 99.50%, and a testing accuracy of 99.00%. This group also saw a high accuracy of 97.50% for Polyps and 99.30% for Non-Polyps, together with a recall of 97.80% for Polyps and 98.90% for Non-Polyps, hence performing very well in identifying both cancerous and healthy regions. The framework proposed here uses K-means clustering in combination with the visualisation of bounding boxes, thereby improving segmentation and yielding a silhouette score of 0.73 with the best cluster configuration. It discusses how to combine feature interpretation challenges into medical imaging for accurate localization and precise segmentation of malignant regions. A good balance between performance and generalization shall be done by hyperparameter optimization-heavy learning rates; dropout rates and overfitting shall be suppressed effectively. The hybrid schema of this work treats the deficiencies of the previous approaches, such as incorporating CNN-based effective feature extraction, Transformer networks for developing attention mechanisms, and finally the fine decision boundary of the support vector machine. Further, we refine this process via unsupervised clustering for the purpose of enhancing the visualisation of such a procedure. Such a holistic framework, hence, further boosts classification and segmentation results by generating understandable outcomes for more rigorous benchmarking of detecting colorectal cancer and with higher reality towards clinical application feasibility.
引用
收藏
页数:37
相关论文
共 71 条
[1]  
Americal Cancer Society, ACS. Online
[2]  
[Anonymous], 2020, IARC Hand Books for Cancer revention, V17
[3]  
[Anonymous], 2022, Colorectal Cancer Facts & Figures 2020-2022
[4]  
[Anonymous], 2021, Cancer Statistics of Japan
[5]  
[Anonymous], 2022, Colorectal Cancer: Risk Factors and Prevention
[6]  
[Anonymous], 2016, Comprehensive guide to ultralytics YOLOv5
[7]  
Ashwath Balraj, 2015, About us
[8]  
Asif S., 2024, Multimedia Tool Appl, P1, DOI [10.1007/s11042-024-19489-x, DOI 10.1007/S11042-024-19489-X]
[9]   Advancements and Prospects of Machine Learning in Medical Diagnostics: Unveiling the Future of Diagnostic Precision [J].
Asif, Sohaib ;
Wenhui, Yi ;
ur-Rehman, Saif- ;
Ul-ain, Qurrat- ;
Amjad, Kamran ;
Yueyang, Yi ;
Jinhai, Si ;
Awais, Muhammad .
ARCHIVES OF COMPUTATIONAL METHODS IN ENGINEERING, 2025, 32 (02) :853-883
[10]   Feature Selection for Colon Cancer Detection Using K-Means Clustering and Modified Harmony Search Algorithm [J].
Bae, Jin Hee ;
Kim, Minwoo ;
Lim, J. S. ;
Geem, Zong Woo .
MATHEMATICS, 2021, 9 (05)