Reader bias in breast cancer screening related to cancer prevalence and artificial intelligence decision support-a reader study

被引:14
作者
Al-Bazzaz, Hanen [1 ]
Janicijevic, Marina [1 ]
Strand, Fredrik [2 ,3 ]
机构
[1] Malarsjukhuset Eskilstuna, Kungsvagen 42, S-63349 Eskilstuna, Sweden
[2] Karolinska Inst, Dept Oncol Pathol, L2 03,Karolinska Vagen 8, S-17164 Solna, Sweden
[3] Karolinska Univ Hosp, Breast Radiol, Med Diagnost Karolinska, NB1 03,Gavlegatan 55, S-17176 Stockholm, Sweden
关键词
Breast; Cancer screening; Mammography; Artificial intelligence; Bias;
D O I
10.1007/s00330-023-10514-5
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
ObjectivesThe aim of our study was to examine how breast radiologists would be affected by high cancer prevalence and the use of artificial intelligence (AI) for decision support.Materials and methodThis reader study was based on selection of screening mammograms, including the original radiologist assessment, acquired in 2010 to 2013 at the Karolinska University Hospital, with a ratio of 1:1 cancer versus healthy based on a 2-year follow-up. A commercial AI system generated an exam-level positive or negative read, and image markers. Double-reading and consensus discussions were first performed without AI and later with AI, with a 6-week wash-out period in between. The chi-squared test was used to test for differences in contingency tables.ResultsMammograms of 758 women were included, half with cancer and half healthy. 52% were 40-55 years; 48% were 56-75 years. In the original non-enriched screening setting, the sensitivity was 61% (232/379) at specificity 98% (323/379). In the reader study, the sensitivity without and with AI was 81% (307/379) and 75% (284/379) respectively (p<0.001). The specificity without and with AI was 67% (255/379) and 86% (326/379) respectively (p<0.001). The tendency to change assessment from positive to negative based on erroneous AI information differed between readers and was affected by type and number of image signs of malignancy.ConclusionBreast radiologists reading a list with high cancer prevalence performed at considerably higher sensitivity and lower specificity than the original screen-readers. Adding AI information, calibrated to a screening setting, decreased sensitivity and increased specificity.Clinical relevance statementRadiologist screening mammography assessments will be biased towards higher sensitivity and lower specificity by high-risk triaging and nudged towards the sensitivity and specificity setting of AI reads. After AI implementation in clinical practice, there is reason to carefully follow screening metrics to ensure the impact is desired.
引用
收藏
页码:5415 / 5424
页数:10
相关论文
共 16 条
[1]   Automation Bias in Breast AI [J].
Baltzer, Pascal A. T. .
RADIOLOGY, 2023, 307 (04)
[2]   Artificial intelligence for breast cancer detection in screening mammography in Sweden: a prospective, population-based, paired-reader, non-inferiority study [J].
Dembrower, Karin ;
Crippa, Alessio ;
Colon, Eugenia ;
Eklund, Martin .
LANCET DIGITAL HEALTH, 2023, 5 (10) :e703-e711
[3]   Implications for downstream workload based on calibrating an artificial intelligence detection algorithm by standalone-reader or combined-reader sensitivity matching [J].
Dembrower, Karin ;
Salim, Mattie ;
Eklund, Martin ;
Lindholm, Peter ;
Strand, Fredrik .
JOURNAL OF MEDICAL IMAGING, 2023, 10
[4]  
Dembrower K, 2020, LANCET DIGIT HEALTH, V2, pE468, DOI 10.1016/S2589-7500(20)30185-0
[5]   A Multi-million Mammography Image Dataset and Population-Based Screening Cohort for the Training and Evaluation of Deep Neural Networks-the Cohort of Screen-Aged Women (CSAW) [J].
Dembrower, Karin ;
Lindholm, Peter ;
Strand, Fredrik .
JOURNAL OF DIGITAL IMAGING, 2020, 33 (02) :408-413
[6]   Automation Bias in Mammography: The Impact of Artificial Intelligence BI-RADS Suggestions on Reader Performance [J].
Dratsch, Thomas ;
Chen, Xue ;
Mehrizi, Mohammad Rezazade ;
Kloeckner, Roman ;
Maehringer-Kunz, Aline ;
Puesken, Michael ;
Baessler, Bettina ;
Sauer, Stephanie ;
Maintz, David ;
Santos, Daniel Pinto dos .
RADIOLOGY, 2023, 307 (04)
[7]   Breast imaging: Beyond the detection [J].
Galati, Francesca ;
Moffa, Giuliana ;
Pediconi, Federica .
EUROPEAN JOURNAL OF RADIOLOGY, 2022, 146
[8]   Can artificial intelligence reduce the interval cancer rate in mammography screening? [J].
Lang, Kristina ;
Hofvind, Solveig ;
Rodriguez-Ruiz, Alejandro ;
Andersson, Ingvar .
EUROPEAN RADIOLOGY, 2021, 31 (08) :5940-5947
[9]   Artificial Intelligence Evaluation of 122 969 Mammography Examinations from a Population-based Screening Program [J].
Larsen, Marthe ;
Aglen, Camilla F. ;
Lee, Christoph, I ;
Hoff, Solveig R. ;
Lund-Hanssen, Hakon ;
Lang, Kristina ;
Nygard, Jan F. ;
Ursin, Giske ;
Hofvind, Solveig .
RADIOLOGY, 2022, 303 (03) :502-511
[10]   An Artificial Intelligence-based Mammography Screening Protocol for Breast Cancer: Outcome and Radiologist Workload [J].
Lauritzen, Andreas D. ;
Rodriguez-Ruiz, Alejandro ;
Von Euler-Chelpin, My Catarina ;
Lynge, Elsebeth ;
Vejborg, Ilse ;
Nielsen, Mads ;
Karssemeijer, Nico ;
Lillholm, Martin .
RADIOLOGY, 2022, 304 (01) :41-49