The Effectiveness of Self-supervised Pre-training for Multi-modal Endometriosis Classification

被引：2

作者：

Butler, David ^{[1
]}

Wang, Hu ^{[1
]}

Zhang, Yuan ^{[1
]}

To, Minh-Son ^{[2
]}

Condous, George ^{[3
]}

Leonardi, Mathew ^{[4
]}

Knox, Steven ^{[5
]}

Avery, Jodie ^{[6
]}

Hull, M. Louise ^{[6
]}

Carneiro, Gustavo ^{[7
]}

机构：

[1] Univ Adelaide, Australian Inst Machine Learning, Adelaide, Australia

[2] Flinders Univ S Australia, Flinders Hlth & Med Res Inst, Adelaide, Australia

[3] Omnigynaecare, Sydney, Australia

[4] McMaster Univ, Hamilton, ON, Canada

[5] Benson Radiol, Adelaide, SA, Australia

[6] Univ Adelaide, Robinson Res Inst, Adelaide, Australia

[7] Univ Surrey, Ctr Vis Speech & Signal Proc, Guildford, England

来源：

2023 45TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY, EMBC | 2023年

关键词：

Self-supervision; Multi-modal learning; MRI; Endometriosis;

D O I：

10.1109/EMBC40787.2023.10340504

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Endometriosis is a debilitating condition affecting 5% to 10% of the women worldwide, where early detection and treatment are the best tools to manage the condition. Early detection can be done via surgery, but multi-modal medical imaging is preferable given the simpler and faster process. However, imaging-based endometriosis diagnosis is challenging as 1) there are few capable clinicians; and 2) it is characterised by small lesions unconfined to a specific location. These two issues challenge the development of endometriosis classifiers as the training datasets tend to be small and contain difficult samples, which leads to overfitting. Hence, it is important to consider generalisation techniques to mitigate this problem, particularly self-supervised pre-training methods that have shown outstanding results in computer vision and natural language processing applications. The main goal of this paper is to study the effectiveness of modern self-supervised pre-training techniques to overcome the two issues mentioned above for the classification of endometriosis from multi-modal imaging data. We also introduce a new masking image modelling self-supervised pre-training method that works with 3D multi-modal medical imaging. Furthermore, to the best of our knowledge, this paper presents the first endometriosis classifier, fine-tuned from the pre-trained model above, which works with multimodal (i.e., T1 and T2) magnetic resonance imaging (MRI) data. Our results show that self-supervised pre-training improves endometriosis classification by as much as 31%, when compared with classifiers trained from scratch.

引用

页数：5

共 50 条

[11] Self-supervised Pre-training for Nuclei Segmentation
Haq, Mohammad Minhazul
Huang, Junzhou
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT II, 2022, 13432 : 303 - 313
[12] Multi-Modal Contrastive Pre-training for Recommendation
Liu, Zhuang
Ma, Yunpu
Schubert, Matthias
Ouyang, Yuanxin
Xiong, Zhang
PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2022, 2022, : 99 - 108
[13] A SELF-SUPERVISED PRE-TRAINING FRAMEWORK FOR VISION-BASED SEIZURE CLASSIFICATION
Hou, Jen-Cheng
McGonigal, Aileen
Bartolomei, Fabrice
Thonnat, Monique
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1151 - 1155
[14] Self-supervised pre-training improves fundus image classification for diabetic retinopathy
Lee, Joohyung
Lee, Eung-Joo
REAL-TIME IMAGE PROCESSING AND DEEP LEARNING 2022, 2022, 12102
[15] Self-Supervised Training of Speaker Encoder With Multi-Modal Diverse Positive Pairs
Tao, Ruijie
Lee, Kong Aik
Das, Rohan Kumar
Hautamaki, Ville
Li, Haizhou
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 1706 - 1719
[16] MULTI-MODAL PRE-TRAINING FOR AUTOMATED SPEECH RECOGNITION
Chan, David M.
Ghosh, Shalini
Chakrabarty, Debmalya
Hoffmeister, Bjorn
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 246 - 250
[17] Multi-label remote sensing classification with self-supervised gated multi-modal transformers
Liu, Na
Yuan, Ye
Wu, Guodong
Zhang, Sai
Leng, Jie
Wan, Lihong
FRONTIERS IN COMPUTATIONAL NEUROSCIENCE, 2024, 18
[18] Object Adaptive Self-Supervised Dense Visual Pre-Training
Zhang, Yu
Zhang, Tao
Zhu, Hongyuan
Chen, Zihan
Mi, Siya
Peng, Xi
Geng, Xin
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 2228 - 2240
[19] UniVIP: A Unified Framework for Self-Supervised Visual Pre-training
Li, Zhaowen
Zhu, Yousong
Yang, Fan
Li, Wei
Zhao, Chaoyang
Chen, Yingying
Chen, Zhiyang
Xie, Jiahao
Wu, Liwei
Zhao, Rui
Tang, Ming
Wang, Jinqiao
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 14607 - 14616
[20] Representation Recovering for Self-Supervised Pre-training on Medical Images
Yan, Xiangyi
Naushad, Junayed
Sun, Shanlin
Han, Kun
Tang, Hao
Kong, Deying
Ma, Haoyu
You, Chenyu
Xie, Xiaohui
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2684 - 2694

← 1 2 3 4 5 →