Reliable quality assurance of X-ray mammography scanner by evaluation the standard mammography phantom image using an interpretable deep learning model

被引:6
作者
Oh, Jang-Hoon [1 ]
Kim, Hyug-Gi [2 ]
Lee, Kyung Mi [2 ]
Ryu, Chang-Woo [3 ]
机构
[1] Kyung Hee Univ, Grad Sch, Dept Biomed Sci & Technol, 23 Kyungheedae Ro, Seoul 02447, South Korea
[2] Kyung Hee Univ, Coll Med, Dept Radiol, Kyung Hee Univ Hosp, 23 Kyungheedae Ro, Seoul 02447, South Korea
[3] Kyung Hee Univ, Kyung Hee Univ Hosp Gangdong, Dept Radiol, Coll Med, 892 Dongnam Ro, Seoul 05278, South Korea
基金
新加坡国家研究基金会;
关键词
Artificial intelligence; Deep learning; Interpretability; Explainability; Mammography; Phantom; Quality Control; ACCREDITATION PHANTOM;
D O I
10.1016/j.ejrad.2022.110369
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Objective: Mammography is the initial examination to detect breast cancer symptoms, and quality control of mammography devices is crucial to maintain accurate diagnosis and to safeguard against degradation of performance. The objective of this study was to assist radiologists in mammography phantom image evaluation by developing and validating an interpretable deep learning model capable of objectively evaluating the quality of standard phantom images for mammography.Materials and Methods: A total of 2,208 mammography phantom images were collected for periodic accreditation of the scanner from 1,755 institutions. The dataset was randomly split into training (1,808 images) and testing (400 images) datasets with subgroups (76 images) for the multi-reader study. To develop an interpretable model that contains two deep learning networks in series, five processing steps were performed: mammography phantom detection, phantom object detection, post-processing, score evaluation, and a report with a comment about ambiguous results. Results: For phantom detection, the accuracy and mean intersection over union (mIOU) were 1.00 and 0.938 in the test dataset, respectively. During phantom object detection, a total of 6,369 out of 6,400 objects were detected as the correct object class, and the accuracy and mIOU were 0.995 and 0.813, respectively. The predicted score for each object showed a consensus of 97.40% excluding ambiguous points and 59.10% for ambiguous points of the groups.Conclusions: The interpretable deep learning model using large-scale data from multiple centers shows high performance and reasonable object scoring, successfully validating the reliability and feasibility of mammography phantom image quality management.
引用
收藏
页数:10
相关论文
共 16 条
  • [1] Deep Learning to Distinguish Recalled but Benign Mammography Images in Breast Cancer Screening
    Aboutalib, Sarah S.
    Mohamed, Aly A.
    Berg, Wendie A.
    Zuley, Margarita L.
    Sumkin, Jules H.
    Wu, Shandong
    [J]. CLINICAL CANCER RESEARCH, 2018, 24 (23) : 5902 - 5909
  • [2] SUBJECTIVE EVALUATIONS OF MAMMOGRAPHIC ACCREDITATION PHANTOM IMAGES BY 3 OBSERVER GROUPS
    BROOKS, KW
    TRUEBLOOD, JH
    KEARFOTT, KJ
    [J]. INVESTIGATIVE RADIOLOGY, 1994, 29 (01) : 42 - 47
  • [3] Eric A., 2018, DIGITAL MAMMOGRAPHY
  • [4] Automated deep learning design for medical image classification by health-care professionals with no coding experience: a feasibility study
    Faes, Livia
    Wagner, Siegfried K.
    Fu, Dun Jack
    Liu, Xiaoxuan
    Korot, Edward
    Ledsam, Joseph R.
    Back, Trevor
    Chopra, Reena
    Pontikos, Nikolas
    Kern, Christoph
    Moraes, Gabriella
    Schmid, Martin K.
    Sim, Dawn
    Balaskas, Konstantinos
    Bachmann, Lucas M.
    Denniston, Alastair K.
    Keane, Pearse A.
    [J]. LANCET DIGITAL HEALTH, 2019, 1 (05): : E232 - E242
  • [5] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [6] Extensions of the External Validation for Checking Learned Model Interpretability and Generalizability
    Ho, Sung Yang
    Phua, Kimberly
    Wong, Limsoon
    Bin Goh, Wilson Wen
    [J]. PATTERNS, 2020, 1 (08):
  • [7] How good is the ACR accreditation phantom for assessing image quality in digital mammography?'
    Huda, W
    Sajewicz, AM
    Ogden, KM
    Scalzetti, EM
    Dance, DR
    [J]. ACADEMIC RADIOLOGY, 2002, 9 (07) : 764 - 772
  • [8] Improvement diagnostic accuracy of sinusitis recognition in paranasal sinus X-ray using multiple deep learning models
    Kim, Hyug-Gi
    Lee, Kyung Mi
    Kim, Eui Jong
    Lee, Jin San
    [J]. QUANTITATIVE IMAGING IN MEDICINE AND SURGERY, 2019, 9 (06) : 942 - 951
  • [9] Benefits and Harms of Breast Cancer Screening A Systematic Review
    Myers, Evan R.
    Moorman, Patricia
    Gierisch, Jennifer M.
    Havrilesky, Laura J.
    Grimm, Lars J.
    Ghate, Sujata
    Davidson, Brittany
    Mongtomery, Ranee Chatterjee
    Crowley, Matthew J.
    McCrory, Douglas C.
    Kendrick, Amy
    Sanders, Gillian D.
    [J]. JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2015, 314 (15): : 1615 - 1634
  • [10] Minimum information about clinical artificial intelligence modeling: the MI-CLAIM checklist
    Norgeot, Beau
    Quer, Giorgio
    Beaulieu-Jones, Brett K.
    Torkamani, Ali
    Dias, Raquel
    Gianfrancesco, Milena
    Arnaout, Rima
    Kohane, Isaac S.
    Saria, Suchi
    Topol, Eric
    Obermeyer, Ziad
    Yu, Bin
    Butte, Atul J.
    [J]. NATURE MEDICINE, 2020, 26 (09) : 1320 - 1324