Evaluation of Methods for Detection and Semantic Segmentation of the Anterior Capsulotomy in Cataract Surgery Video

被引:2
作者
Zeng, Zixue [1 ]
Giap, Binh Duong [2 ]
Kahana, Ethan [3 ]
Lustre, Jefferson [4 ]
Mahmoud, Ossama [5 ]
Mian, Shahzad, I [2 ]
Tannen, Bradford [2 ]
Nallasamy, Nambi [2 ,6 ,7 ]
机构
[1] Univ Michigan, Sch Publ Hlth, Ann Arbor, MI USA
[2] Univ Michigan, Kellogg Eye Ctr, Dept Ophthalmol & Visual Sci, Ann Arbor, MI USA
[3] Univ Michigan, Dept Comp Sci, Ann Arbor, MI USA
[4] Univ Michigan, Sch Med, Ann Arbor, MI USA
[5] Wayne State Univ, Sch Med, Detroit, MI USA
[6] Univ Michigan, Dept Computat Med & Bioinformat, Ann Arbor, MI USA
[7] Univ Michigan, Kellogg Eye Ctr, 1000 Wall St, Ann Arbor, MI 48105 USA
关键词
cataract surgery; capsulotomy; deep learning; image segmentation; image classification;
D O I
10.2147/OPTH.S453073
中图分类号
R77 [眼科学];
学科分类号
100212 ;
摘要
Background: The capsulorhexis is one of the most important and challenging maneuvers in cataract surgery. Automated analysis of the anterior capsulotomy could aid surgical training through the provision of objective feedback and guidance to trainees. Purpose: To develop and evaluate a deep learning -based system for the automated identification and semantic segmentation of the anterior capsulotomy in cataract surgery video. Methods: In this study, we established a BigCat-Capsulotomy dataset comprising 1556 video frames extracted from 190 recorded cataract surgery videos for developing and validating the capsulotomy recognition system. The proposed system involves three primary stages: video preprocessing, capsulotomy video frame classification, and capsulotomy segmentation. To thoroughly evaluate its efficacy, we examined the performance of a total of eight deep learning -based classification models and eleven segmentation models, assessing both accuracy and time consumption. Furthermore, we delved into the factors influencing system performance by deploying it across various surgical phases. Results: The ResNet-152 model employed in the classification step of the proposed capsulotomy recognition system attained strong performance with an overall Dice coefficient of 92.21%. Similarly, the UNet model with the DenseNet-169 backbone emerged as the most effective segmentation model among those investigated, achieving an overall Dice coefficient of 92.12%. Moreover, the time consumption of the system was low at 103.37 milliseconds per frame, facilitating its application in real-time scenarios. Phase -wise analysis indicated that the Phacoemulsification phase (nuclear disassembly) was the most challenging to segment (Dice coefficient of 86.02%). Conclusion: The experimental results showed that the proposed system is highly effective in intraoperative capsulotomy recognition during cataract surgery and demonstrates both high accuracy and real-time capabilities. This system holds significant potential for applications in surgical performance analysis, education, and intraoperative guidance systems.
引用
收藏
页码:647 / 657
页数:11
相关论文
共 24 条
[1]   Anterior Capsulotomy Integrity after Femtosecond Laser-Assisted Cataract Surgery [J].
Abell, Robin G. ;
Davies, Peter E. J. ;
Phelan, David ;
Goemann, Karsten ;
McPherson, Zachary E. ;
Vote, Brendan J. .
OPHTHALMOLOGY, 2014, 121 (01) :17-24
[2]  
Chaurasia A, 2017, 2017 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP)
[3]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[4]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[5]   Tensor-based Feature Extraction for Pupil Recognition in Cataract Surgery [J].
Giap, Binh Duong ;
Srinivasan, Karthik ;
Mahmoud, Ossama ;
Mian, Shahzad Ihsan ;
Tannen, Bradford Laurence ;
Nallasamy, Nambi .
2023 45TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY, EMBC, 2023,
[6]   DEVELOPMENT, ADVANTAGES, AND METHODS OF THE CONTINUOUS CIRCULAR CAPSULORHEXIS TECHNIQUE [J].
GIMBEL, HV ;
NEUHANN, T .
JOURNAL OF CATARACT AND REFRACTIVE SURGERY, 1990, 16 (01) :31-37
[7]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[8]   Densely Connected Convolutional Networks [J].
Huang, Gao ;
Liu, Zhuang ;
van der Maaten, Laurens ;
Weinberger, Kilian Q. .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2261-2269
[9]   The Effect of ND:YAG Laser Posterior Capsulotomy Size on Refraction, Intraocular Pressure, and Macular Thickness [J].
Karahan, Eyyup ;
Tuncer, Ibrahim ;
Zengin, Mehmet Ozgur .
JOURNAL OF OPHTHALMOLOGY, 2014, 2014
[10]   Intraocular Lens Tilt and Decentration Measured By Scheimpflug Camera Following Manual or Femtosecond Laser-created Continuous Circular Capsulotomy [J].
Kranitz, Kinga ;
Mihaltz, Kata ;
Sandor, Gabor L. ;
Takacs, Agnes ;
Knorz, Michael C. ;
Nagy, Zoltan Z. .
JOURNAL OF REFRACTIVE SURGERY, 2012, 28 (04) :259-263