Incomplete Multimodal Learning for Visual Acuity Prediction After Cataract Surgery Using Masked Self-Attention

被引:3
|
作者
Zhou, Qian [1 ]
Zou, Hua [1 ]
Jiang, Haifeng [2 ]
Wang, Yong [2 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan, Peoples R China
[2] Wuhan Univ, Aier Eye Hosp, Wuhan, Peoples R China
来源
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT VII | 2023年 / 14226卷
关键词
Incomplete Multimodal Learning; Visual Acuity; Prediction; Self-Attention;
D O I
10.1007/978-3-031-43990-2_69
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As the primary treatment option for cataracts, it is estimated that millions of cataract surgeries are performed each year globally. Predicting the Best Corrected Visual Acuity (BCVA) in cataract patients is crucial before surgeries to avoid medical disputes. However, accurate prediction remains a challenge in clinical practice. Traditional methods based on patient characteristics and surgical parameters have limited accuracy and often underestimate postoperative visual acuity. In this paper, we propose a novel framework for predicting visual acuity after cataract surgery using masked self-attention. Especially different from existing methods, which are based on monomodal data, our proposed method takes preoperative images and patient demographic data as input to leverage multimodal information. Furthermore, we expand our method to a more complex and challenging clinical scenario, i.e., the incomplete multimodal data. Firstly, we apply efficient Transformers to extract modality-specific features. Then, an attentional fusion network is utilized to fuse the multimodal information. To address the modality-missing problem, an attention mask mechanism is proposed to improve the robustness. We evaluate our method on a collected dataset of 1960 patients who underwent cataract surgery and compare its performance with other state-of-the-art approaches. The results show that our proposed method outperforms other methods and achieves a mean absolute error of 0.122 logMAR. The percentages of the prediction errors within +/- 0.10 logMAR are 94.3%. Besides, extensive experiments are conducted to investigate the effectiveness of each component in predicting visual acuity. Codes will be available at https://github.com/liyiersan/MSA.
引用
收藏
页码:735 / 744
页数:10
相关论文
共 50 条
  • [1] Prediction of Visual Acuity After Cataract Surgery by Deep Learning Methods Using Clinical Information and Color Fundus Photography
    Yang, Che-Ning
    Hsieh, Yi-Ting
    Yeh, Hsu-Hang
    Chu, Hsiao-Sang
    Wu, Jo-Hsuan
    Chen, Wei-Li
    CURRENT EYE RESEARCH, 2025, 50 (03) : 276 - 281
  • [2] Masked face recognition with convolutional visual self-attention network
    Ge, Yiming
    Liu, Hui
    Du, Junzhao
    Li, Zehua
    Wei, Yuheng
    NEUROCOMPUTING, 2023, 518 : 496 - 506
  • [3] AttendAffectNet-Emotion Prediction of Movie Viewers Using Multimodal Fusion with Self-Attention
    Thao, Ha Thi Phuong
    Balamurali, B. T.
    Roig, Gemma
    Herremans, Dorien
    SENSORS, 2021, 21 (24)
  • [4] Predictors of visual acuity improvement after phacoemulsification cataract surgery
    AlRyalat, Saif Aldeen
    Atieh, Duha
    AlHabashneh, Ayed
    Hassouneh, Mariam
    Toukan, Rama
    Alawamleh, Renad
    Alshammari, Taher
    Abu-Ameerh, Mohammed
    FRONTIERS IN MEDICINE, 2022, 9
  • [5] Visual acuity after cataract surgery in patients with optic neuropathies
    Aggarwal, Sahil
    Knight, Darren K.
    Shumway, Caleb L.
    Wade, Matthew
    Crow, Robert W.
    ACTA OPHTHALMOLOGICA, 2019, 97 (04) : E514 - E518
  • [6] An Optical Coherence Tomography-Based Deep Learning Algorithm for Visual Acuity Prediction of Highly Myopic Eyes After Cataract Surgery
    Wei, Ling
    He, Wenwen
    Wang, Jinrui
    Zhang, Keke
    Du, Yu
    Qi, Jiao
    Meng, Jiaqi
    Qiu, Xiaodi
    Cai, Lei
    Fan, Qi
    Zhao, Zhennan
    Tang, Yating
    Ni, Shuang
    Guo, Haike
    Song, Yunxiao
    He, Xixi
    Ding, Dayong
    Lu, Yi
    Zhu, Xiangjia
    FRONTIERS IN CELL AND DEVELOPMENTAL BIOLOGY, 2021, 9
  • [7] Predicting Esophageal Fistula Risks Using a Multimodal Self-attention Network
    Guan, Yulu
    Cui, Hui
    Xu, Yiyue
    Jin, Qiangguo
    Feng, Tian
    Tu, Huawei
    Xuan, Ping
    Li, Wanlong
    Wang, Linlin
    Duh, Been-Lirn
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT V, 2021, 12905 : 721 - 730
  • [8] The automatic 'Freiburg Visual Acuity Test' used before and after cataract surgery
    Rover, J
    Bornkessel, HC
    Nisius, A
    Bach, M
    KLINISCHE MONATSBLATTER FUR AUGENHEILKUNDE, 1996, 209 (05) : 315 - 316
  • [9] Gain in visual acuity after cataract surgery improves postural stability and mobility
    Durmus, B.
    Emre, S.
    Cankaya, C.
    Baysal, O.
    Altay, Z.
    BRATISLAVA MEDICAL JOURNAL-BRATISLAVSKE LEKARSKE LISTY, 2011, 112 (12): : 701 - 705
  • [10] A Self-Attention Integrated Learning Model for Landing Gear Performance Prediction
    Lin, Lin
    Tong, Changsheng
    Guo, Feng
    Fu, Song
    Lv, Yancheng
    He, Wenhui
    SENSORS, 2023, 23 (13)