Incomplete Multimodal Learning for Visual Acuity Prediction After Cataract Surgery Using Masked Self-Attention

被引：3

作者：

Zhou, Qian ^{[1
]}

Zou, Hua ^{[1
]}

Jiang, Haifeng ^{[2
]}

Wang, Yong ^{[2
]}

机构：

[1] Wuhan Univ, Sch Comp Sci, Wuhan, Peoples R China

[2] Wuhan Univ, Aier Eye Hosp, Wuhan, Peoples R China

来源：

MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT VII | 2023年 / 14226卷

关键词：

Incomplete Multimodal Learning; Visual Acuity; Prediction; Self-Attention;

D O I：

10.1007/978-3-031-43990-2_69

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

As the primary treatment option for cataracts, it is estimated that millions of cataract surgeries are performed each year globally. Predicting the Best Corrected Visual Acuity (BCVA) in cataract patients is crucial before surgeries to avoid medical disputes. However, accurate prediction remains a challenge in clinical practice. Traditional methods based on patient characteristics and surgical parameters have limited accuracy and often underestimate postoperative visual acuity. In this paper, we propose a novel framework for predicting visual acuity after cataract surgery using masked self-attention. Especially different from existing methods, which are based on monomodal data, our proposed method takes preoperative images and patient demographic data as input to leverage multimodal information. Furthermore, we expand our method to a more complex and challenging clinical scenario, i.e., the incomplete multimodal data. Firstly, we apply efficient Transformers to extract modality-specific features. Then, an attentional fusion network is utilized to fuse the multimodal information. To address the modality-missing problem, an attention mask mechanism is proposed to improve the robustness. We evaluate our method on a collected dataset of 1960 patients who underwent cataract surgery and compare its performance with other state-of-the-art approaches. The results show that our proposed method outperforms other methods and achieves a mean absolute error of 0.122 logMAR. The percentages of the prediction errors within +/- 0.10 logMAR are 94.3%. Besides, extensive experiments are conducted to investigate the effectiveness of each component in predicting visual acuity. Codes will be available at https://github.com/liyiersan/MSA.

引用

页码：735 / 744

页数：10

共 50 条

[1] Prediction of Visual Acuity After Cataract Surgery by Deep Learning Methods Using Clinical Information and Color Fundus Photography
Yang, Che-Ning
Hsieh, Yi-Ting
Yeh, Hsu-Hang
Chu, Hsiao-Sang
Wu, Jo-Hsuan
Chen, Wei-Li
CURRENT EYE RESEARCH, 2025, 50 (03) : 276 - 281
[2] Masked face recognition with convolutional visual self-attention network
Ge, Yiming
Liu, Hui
Du, Junzhao
Li, Zehua
Wei, Yuheng
NEUROCOMPUTING, 2023, 518 : 496 - 506
[3] AttendAffectNet-Emotion Prediction of Movie Viewers Using Multimodal Fusion with Self-Attention
Thao, Ha Thi Phuong
Balamurali, B. T.
Roig, Gemma
Herremans, Dorien
SENSORS, 2021, 21 (24)
[4] Predictors of visual acuity improvement after phacoemulsification cataract surgery
AlRyalat, Saif Aldeen
Atieh, Duha
AlHabashneh, Ayed
Hassouneh, Mariam
Toukan, Rama
Alawamleh, Renad
Alshammari, Taher
Abu-Ameerh, Mohammed
FRONTIERS IN MEDICINE, 2022, 9
[5] Visual acuity after cataract surgery in patients with optic neuropathies
Aggarwal, Sahil
Knight, Darren K.
Shumway, Caleb L.
Wade, Matthew
Crow, Robert W.
ACTA OPHTHALMOLOGICA, 2019, 97 (04) : E514 - E518
[6] An Optical Coherence Tomography-Based Deep Learning Algorithm for Visual Acuity Prediction of Highly Myopic Eyes After Cataract Surgery
Wei, Ling
He, Wenwen
Wang, Jinrui
Zhang, Keke
Du, Yu
Qi, Jiao
Meng, Jiaqi
Qiu, Xiaodi
Cai, Lei
Fan, Qi
Zhao, Zhennan
Tang, Yating
Ni, Shuang
Guo, Haike
Song, Yunxiao
He, Xixi
Ding, Dayong
Lu, Yi
Zhu, Xiangjia
FRONTIERS IN CELL AND DEVELOPMENTAL BIOLOGY, 2021, 9
[7] Predicting Esophageal Fistula Risks Using a Multimodal Self-attention Network
Guan, Yulu
Cui, Hui
Xu, Yiyue
Jin, Qiangguo
Feng, Tian
Tu, Huawei
Xuan, Ping
Li, Wanlong
Wang, Linlin
Duh, Been-Lirn
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT V, 2021, 12905 : 721 - 730
[8] The automatic 'Freiburg Visual Acuity Test' used before and after cataract surgery
Rover, J
Bornkessel, HC
Nisius, A
Bach, M
KLINISCHE MONATSBLATTER FUR AUGENHEILKUNDE, 1996, 209 (05) : 315 - 316
[9] Gain in visual acuity after cataract surgery improves postural stability and mobility
Durmus, B.
Emre, S.
Cankaya, C.
Baysal, O.
Altay, Z.
BRATISLAVA MEDICAL JOURNAL-BRATISLAVSKE LEKARSKE LISTY, 2011, 112 (12): : 701 - 705
[10] A Self-Attention Integrated Learning Model for Landing Gear Performance Prediction
Lin, Lin
Tong, Changsheng
Guo, Feng
Fu, Song
Lv, Yancheng
He, Wenhui
SENSORS, 2023, 23 (13)

← 1 2 3 4 5 →