Improving breast cancer diagnostics with deep learning for MRI

被引:79
作者
Witowski, Jan [1 ,2 ]
Heacock, Laura [1 ]
Reig, Beatriu [1 ]
Kang, Stella K. [1 ,3 ]
Lewin, Alana [1 ]
Pysarenko, Kristine [1 ]
Patel, Shalin [1 ]
Samreen, Naziya [1 ]
Rudnicki, Wojciech [4 ]
Luczynska, Elzbieta [4 ]
Popiela, Tadeusz [5 ]
Moy, Linda [1 ,2 ,6 ,7 ]
Geras, Krzysztof J. [1 ,2 ,6 ,7 ,8 ,9 ]
机构
[1] New York Univ, Dept Radiol, Grossman Sch Med, New York, NY 10016 USA
[2] New York Univ, Ctr Adv Innovat & Res, New York, NY 10016 USA
[3] New York Univ, Dept Populat Hlth, Grossman Sch Med, New York, NY 10016 USA
[4] Jagiellonian Univ, Electradiol Dept, Med Coll, PL-31126 Krakow, Poland
[5] Jagiellonian Univ, Chair Radiol, Med Coll, PL-31501 Krakow, Poland
[6] New York Univ, Vilcek Inst Grad Biomed Sci, Grossman Sch Med, New York, NY 10016 USA
[7] New York Univ Langone Hlth, Perlmutter Canc Ctr, New York, NY 10016 USA
[8] New York Univ, Ctr Data Sci, New York, NY 10011 USA
[9] New York Univ, Courant Inst Math Sci, Dept Comp Sci, New York, NY 10012 USA
基金
美国国家卫生研究院;
关键词
MULTIREADER; ACCURACY; CURVES; SYSTEM; RISK;
D O I
10.1126/scitranslmed.abo4802
中图分类号
Q2 [细胞生物学];
学科分类号
071009 ; 090102 ;
摘要
Dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) has a high sensitivity in detecting breast cancer but often leads to unnecessary biopsies and patient workup. We used a deep learning (DL) system to improve the overall accuracy of breast cancer diagnosis and personalize management of patients undergoing DCE-MRI. On the internal test set (n = 3936 exams), our system achieved an area under the receiver operating characteristic curve (AUROC) of 0.92 (95% CI: 0.92 to 0.93). In a retrospective reader study, there was no statistically significant difference (P = 0.19) between five board-certified breast radiologists and the DL system (mean Delta AUROC, +0.04 in favor of the DL system). Radiologists' performance improved when their predictions were averaged with DL's predictions [mean Delta AUPRC (area under the precision-recall curve), +0.07]. We demonstrated the generalizability of the DL system using multiple datasets from Poland and the United States. An additional reader study on a Polish dataset showed that the DL system was as robust to distribution shift as radiologists. In subgroup analysis, we observed consistent results across different cancer subtypes and patient demographics. Using decision curve analysis, we showed that the DL system can reduce unnecessary biopsies in the range of clinically relevant risk thresholds. This would lead to avoiding biopsies yielding benign results in up to 20% of all patients with BI-RADS category 4 lesions. Last, we performed an error analysis, investigating situations where DL predictions were mostly incorrect. This exploratory work creates a foundation for deployment and prospective analysis of DL-based models for breast MRI.
引用
收藏
页数:13
相关论文
共 65 条
[41]   Prospective assessment of breast cancer risk from multimodal multiview ultrasound images via clinically applicable deep learning [J].
Qian, Xuejun ;
Pei, Jing ;
Zheng, Hui ;
Xie, Xinxin ;
Yan, Lin ;
Zhang, Hao ;
Han, Chunguang ;
Gao, Xiang ;
Zhang, Hanqi ;
Zheng, Weiwei ;
Sun, Qiang ;
Lu, Lu ;
Shung, K. Kirk .
NATURE BIOMEDICAL ENGINEERING, 2021, 5 (06) :522-+
[42]  
Raghu M, 2019, ADV NEUR IN, V32
[43]  
Randolph JJ., 2005, JOENS LEARN INSTR S
[44]   Radiomics and deep learning methods in expanding the use of screening breast MRI [J].
Reig, Beatriu .
EUROPEAN RADIOLOGY, 2021, 31 (08) :5863-5865
[45]   pROC: an open-source package for R and S plus to analyze and compare ROC curves [J].
Robin, Xavier ;
Turck, Natacha ;
Hainard, Alexandre ;
Tiberti, Natalia ;
Lisacek, Frederique ;
Sanchez, Jean-Charles ;
Mueller, Markus .
BMC BIOINFORMATICS, 2011, 12
[46]   Stand-Alone Artificial Intelligence for Breast Cancer Detection in Mammography: Comparison With 101 Radiologists [J].
Rodriguez-Ruiz, Alejandro ;
Lang, Kristina ;
Gubern-Merida, Albert ;
Broeders, Mireille ;
Gennaro, Gisella ;
Clauser, Paola ;
Helbich, Thomas H. ;
Chevalier, Margarita ;
Tan, Tao ;
Mertelmeier, Thomas ;
Wallis, Matthew G. ;
Andersson, Ingvar ;
Zackrisson, Sophia ;
Mann, Ritse M. ;
Sechopoulos, Ioannis .
JNCI-JOURNAL OF THE NATIONAL CANCER INSTITUTE, 2019, 111 (09) :916-922
[47]  
Saha Ashirbani, 2021, TCIA, DOI 10.7937/TCIA.E3SV-RE93
[48]   A machine learning approach to radiogenomics of breast cancer: a study of 922 subjects and 529 DCE-MRI features [J].
Saha, Ashirbani ;
Harowicz, Michael R. ;
Grimm, Lars J. ;
Kim, Connie E. ;
Ghate, Sujata V. ;
Walsh, Ruth ;
Mazurowski, Maciej A. .
BRITISH JOURNAL OF CANCER, 2018, 119 (04) :508-516
[49]   Artificial intelligence system reduces false-positive findings in the interpretation of breast ultrasound exams [J].
Shen, Yiqiu ;
Shamout, Farah E. ;
Oliver, Jamie R. ;
Witowski, Jan ;
Kannan, Kawshik ;
Park, Jungkyu ;
Wu, Nan ;
Huddleston, Connor ;
Wolfson, Stacey ;
Millet, Alexandra ;
Ehrenpreis, Robin ;
Awal, Divya ;
Tyma, Cathy ;
Samreen, Naziya ;
Gao, Yiming ;
Chhor, Chloe ;
Gandhi, Stacey ;
Lee, Cindy ;
Kumari-Subaiya, Sheila ;
Leonard, Cindy ;
Mohammed, Reyhan ;
Moczulski, Christopher ;
Altabet, Jaime ;
Babb, James ;
Lewin, Alana ;
Reig, Beatriu ;
Moy, Linda ;
Heacock, Laura ;
Geras, Krzysztof J. .
NATURE COMMUNICATIONS, 2021, 12 (01)
[50]   A survey on Image Data Augmentation for Deep Learning [J].
Shorten, Connor ;
Khoshgoftaar, Taghi M. .
JOURNAL OF BIG DATA, 2019, 6 (01)