Technical Adequacy of Fully Automated Artificial Intelligence Body Composition Tools: Assessment in a Heterogeneous Sample of External CT Examinations

被引:12
作者
Budovec, Joseph J. [1 ]
Mautz, Alan [2 ]
机构
[1] Med Ctr Radiologists, Norfolk, VA 23507 USA
[2] Northern Light AR Gould Hosp, Presque Isle, ME USA
关键词
artificial intelligence; automated; body composition; CT; deep learning;
D O I
10.2214/AJR.22.28745
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
BACKGROUND. Clinically usable artificial intelligence (AI) tools analyzing imaging studies should be robust to expected variations in study parameters. OBJECTIVE. The purposes of this study were to assess the technical adequacy of a set of automated AI abdominal CT body composition tools in a heterogeneous sample of external CT examinations performed outside of the authors’ hospital system and to explore possible causes of tool failure. METHODS. This retrospective study included 8949 patients (4256 men, 4693 women; mean age, 55.5 ± 15.9 years) who underwent 11,699 abdominal CT examinations performed at 777 unique external institutions with 83 unique scanner models from six manufacturers with images subsequently transferred to the local PACS for clinical purposes. Three independent automated AI tools were deployed to assess body composition (bone attenuation, amount and attenuation of muscle, amount of visceral and subcutaneous fat). One axial series per examination was evaluated. Technical adequacy was defined as tool output values within empirically derived reference ranges. Failures (i.e., tool output outside of reference range) were reviewed to identify possible causes. RESULTS. All three tools were technically adequate in 11,431 of 11,699 (97.7%) examinations. At least one tool failed in 268 (2.3%) of the examinations. Individual adequacy rates were 97.8% for the bone tool, 99.1% for the muscle tool, and 98.9% for the fat tool. A single type of image processing error (anisometry error, due to incorrect DICOM header voxel dimension information) accounted for 81 of 92 (88.0%) examinations in which all three tools failed, and all three tools failed whenever this error occurred. Anisometry error was the most common specific cause of failure of all tools (bone, 31.6%; muscle, 81.0%; fat, 62.8%). A total of 79 of 81 (97.5%) anisometry errors occurred on scanners from a single manufacturer; 80 of 81 (98.8%) occurred on the same scanner model. No cause of failure was identified for 59.4% of failures of the bone tool, 16.0% of failures of the muscle tool, or 34.9% of failures of the fat tool. CONCLUSION. The automated AI body composition tools had high technical adequacy rates in a heterogeneous sample of external CT examinations, supporting the generalizability of the tools and their potential for broad use. CLINICAL IMPACT. Certain causes of AI tool failure related to technical factors may be largely preventable through use of proper acquisition and reconstruction protocols. © 2023 American Roentgen Ray Society. All rights reserved.
引用
收藏
页码:134 / 134
页数:1
相关论文
共 35 条
  • [1] Welcome to the ICD-10 code for sarcopenia
    Anker, Stefan D.
    Morley, John E.
    von Haehling, Stephan
    [J]. JOURNAL OF CACHEXIA SARCOPENIA AND MUSCLE, 2016, 7 (05) : 512 - 514
  • [2] Slowing the Increase in the Population Dose Resulting from CT Scans
    Brenner, D. J.
    [J]. RADIATION RESEARCH, 2010, 174 (06) : 809 - 815
  • [3] Opportunistic screening for osteoporosis on routine computed tomography? An external validation study
    Buckens, Constantinus F.
    Dijkhuis, Gawein
    de Keizer, Bart
    Verhaar, Harald J.
    de Jong, Pim A.
    [J]. EUROPEAN RADIOLOGY, 2015, 25 (07) : 2074 - 2079
  • [4] A Machine Learning Algorithm to Estimate Sarcopenia on Abdominal CT
    Burns, Joseph E.
    Yao, Jianhua
    Chalhoub, Didier
    Chen, Joseph J.
    Summers, Ronald M.
    [J]. ACADEMIC RADIOLOGY, 2020, 27 (03) : 311 - 320
  • [5] Unreported Vertebral Body Compression Fractures at Abdominal Multidetector CT
    Carberry, George A.
    Pooler, B. Dustin
    Binkley, Neil
    Lauder, Travis B.
    Bruce, Richard J.
    Pickhardt, Perry J.
    [J]. RADIOLOGY, 2013, 268 (01) : 120 - 126
  • [6] Machine Learning in Medicine
    Deo, Rahul C.
    [J]. CIRCULATION, 2015, 132 (20) : 1920 - 1930
  • [7] Algorithm Aversion: People Erroneously Avoid Algorithms After Seeing Them Err
    Dietvorst, Berkeley J.
    Simmons, Joseph P.
    Massey, Cade
    [J]. JOURNAL OF EXPERIMENTAL PSYCHOLOGY-GENERAL, 2015, 144 (01) : 114 - 126
  • [8] Toward Generalizability in the Deployment of Artificial Intelligence in Radiology: Role of Computation Stress Testing to Overcome Underspecification
    Eche, Thomas
    Schwartz, Lawrence H.
    Mokrane, Fatima-Zohra
    Dercle, Laurent
    [J]. RADIOLOGY-ARTIFICIAL INTELLIGENCE, 2021, 3 (06)
  • [9] Variation in Attenuation in L1 Trabecular Bone at Different Tube Voltages: Caution Is Warranted When Screening for Osteoporosis With the Use of Opportunistic CT
    Garner, Hillary W.
    Paturzo, Michelle M.
    Gaudier, Gabriela
    Pickhardt, Perry J.
    Wessell, Daniel E.
    [J]. AMERICAN JOURNAL OF ROENTGENOLOGY, 2017, 208 (01) : 165 - 170
  • [10] Automated assessment of longitudinal biomarker changes at abdominal CT: correlation with subsequent cardiovascular events in an asymptomatic adult screening cohort
    Graffy, Peter M.
    Summers, Ronald M.
    Perez, Alberto A.
    Sandfort, Veit
    Zea, Ryan
    Pickhardt, Perry J.
    [J]. ABDOMINAL RADIOLOGY, 2021, 46 (06) : 2976 - 2984