Beyond PhacoTrainer: Deep Learning for Enhanced Trabecular Meshwork Detection in MIGS Videos

被引:0
|
作者
Kara, Su [1 ]
Yang, Michael [1 ]
Yeh, Hsu-Hang [2 ]
Sen, Simmi [1 ]
Hwang, Hannah H. [3 ]
Wang, Sophia Y. [1 ]
机构
[1] Stanford Univ, Dept Ophthalmol, Palo Alto, CA USA
[2] Natl Taiwan Univ, Dept Ophthalmol, Taipei, Taiwan
[3] Cornell Univ, Weill Cornell Sch Med, New York, NY USA
来源
关键词
deep learning; trabecular meshwork (TM); transfer learning; minimally invasive glaucoma surgery (MIGS); surgical video analysis;
D O I
10.1167/tvst.13.9.5
中图分类号
R77 [眼科学];
学科分类号
100212 ;
摘要
Purpose: The purpose of this study was to develop deep learning models for surgical video analysis, capable of identifying minimally invasive glaucoma surgery (MIGS) and locating the trabecular meshwork (TM). Methods: For classification of surgical steps, we had 313 video files (265 for cataract surgery and 48 for MIGS procedures), and for TM segmentation, we had 1743 frames (1110 for TM and 633 for no TM). We used transfer learning to update classification model pretrained to recognize standard cataract surgical steps, enabling it to also identify MIGS procedures. For TM localization, we developed three different models: U-Net, Y-Net, and Cascaded. Segmentation accuracy for TM was measured by calculating the average pixel error between the predicted and ground truth TM locations. Results: Using transfer learning, we developed a model which achieved 87% accuracy for MIGS frame classification, with area under the receiver operating characteristic curve (AUROC) of 0.99. This model maintained a 79% accuracy for identifying 14 standard cataract surgery steps. The overall micro-averaged AUROC was 0.98. The U-Net model excelled in TM segmentation with an Intersection over union (IoU) score of 0.9988 and an average pixel error of 1.47. Conclusions: Building on prior work developing computer vision models for cataract surgical video, we developed models that recognize MIGS procedures and precisely localize the TM with superior performance. Our work demonstrates the potential of transfer learning for extending our computer vision models to new surgeries without the need for extensive additional data collection. Translational Relevance: Computer vision models in surgical videos can underpin the development of systems offering automated feedback for trainees, improving surgical training and patient care.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] PhacoTrainer: Deep Learning for Activity Recognition in Cataract Surgical Videos
    Yeh, Hsu-Hang
    Jain, Anjal
    Fox, Olivia
    Wang, Sophia
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2021, 62 (08)
  • [2] PhacoTrainer: Deep Learning for Cataract Surgical Videos to Track Surgical Tools
    Yeh, Hsu-Hang
    Jain, Anjal M.
    Fox, Olivia
    Sebov, Kostya
    Wang, Sophia Y.
    TRANSLATIONAL VISION SCIENCE & TECHNOLOGY, 2023, 12 (03):
  • [3] PhacoTrainer: Deep Learning for Cataract Surgical Videos to Track Surgical Tools
    Yeh, Hsu-Hang
    Jain, Anjal M.
    Jallow, Mariama
    Sebov, Kostya
    Wang, Sophia Y.
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2022, 63 (07)
  • [4] PhacoTrainer: A Multicenter Study of Deep Learning for Activity Recognition in Cataract Surgical Videos
    Yeh, Hsu-Hang
    Jain, Anjal M.
    Fox, Olivia
    Wang, Sophia Y.
    TRANSLATIONAL VISION SCIENCE & TECHNOLOGY, 2021, 10 (13):
  • [5] Accurate Identification of the Trabecular Meshwork under Gonioscopic View in Real Time Using Deep Learning
    Lin, Ken Y.
    Urban, Gregor
    Yang, Michael C.
    Lee, Lung-Chi
    Lu, Da-Wen
    Alward, Wallace L. M.
    Baldi, Pierre
    OPHTHALMOLOGY, 2022, 129 (09) : 970 - 970
  • [6] Accurate Identification of the Trabecular Meshwork under Gonioscopic View in Real Time Using Deep Learning
    Lin, Ken Y.
    Urban, Gregor
    Yang, Michael C.
    Lee, Lung-Chi
    Lu, Da-Wen
    Alward, Wallace L. M.
    Baldi, Pierre
    OPHTHALMOLOGY GLAUCOMA, 2022, 5 (04): : 402 - 412
  • [7] Deep Learning based Face Liveness Detection in Videos
    Akbulut, Yaman
    Sengur, Abdulkadir
    Budak, Umit
    Ekici, Sami
    2017 INTERNATIONAL ARTIFICIAL INTELLIGENCE AND DATA PROCESSING SYMPOSIUM (IDAP), 2017,
  • [8] Abnormal behavior detection in videos using deep learning
    Jun Wang
    Limin Xia
    Cluster Computing, 2019, 22 : 9229 - 9239
  • [9] Abnormal behavior detection in videos using deep learning
    Wang, Jun
    Xia, Limin
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 4): : S9229 - S9239
  • [10] Application of Deep Learning for Weapons Detection in Surveillance Videos
    Hashmi, Tufail Sajjad Shah
    Ul Haq, Nazeef
    Fraz, Muhammad Moazam
    Shahzad, Muhammad
    2021 INTERNATIONAL CONFERENCE ON DIGITAL FUTURES AND TRANSFORMATIVE TECHNOLOGIES (ICODT2), 2021,