Inter-Rater Reliability of Thyroid Ultrasound Risk Criteria: A Systematic Review and Meta-Analysis

被引:3
作者
Staibano, Phillip [1 ]
Ham, Jennifer [1 ]
Chen, Jennifer [1 ]
Zhang, Han [1 ]
Gupta, Michael K. [1 ]
机构
[1] McMaster Univ, Dept Surg, Div Otolaryngol Head & Neck Surg, 1280 Main St West, Hamilton, ON L8S 4L8, Canada
关键词
thyroid nodules; ultrasound; inter-rater reliability; systematic review; INTEROBSERVER VARIABILITY; DIAGNOSTIC PERFORMANCE; TI-RADS; AMERICAN-COLLEGE; SONOGRAPHIC FEATURES; NODULES; AGREEMENT; VALIDATION; REPRODUCIBILITY; INTRAOBSERVER;
D O I
10.1002/lary.30347
中图分类号
R-3 [医学研究方法]; R3 [基础医学];
学科分类号
1001 ;
摘要
Objective The most commonly employed diagnostic criteria for identifying thyroid nodules include Thyroid Imaging and Reporting Data System (TI-RADS) and American Thyroid Association (ATA) guidelines. The purpose of this systematic review and meta-analysis is to determine the inter-rater reliability of thyroid ultrasound criteria. Methods We performed a library search of MEDLINE (Ovid), EMBASE (Ovid), and Web of Science for full-text articles published from January 2005 to June 2022. We included full-text primary research articles that used TI-RADS and/or ATA guidelines to evaluate thyroid nodules in adults. These included studies must have calculated inter-rater reliability using any validated metric. The Quality Appraisal for Reliability Studies (QAREL) was used to assess study quality. We planned for a random-effects meta-analysis, in addition to covariate and publication bias analyses. This study was performed in accordance with Preferred Reporting Items for a Systematic Review and Meta-analysis guidelines and registered prior to conduction (International prospective register of systematic reviews-PROSPERO: CRD42021275072). Results Of the 951 articles identified via the database search, 35 met eligibility criteria. All studies were observational. The most commonly utilized criteria were ACR Thyroid Imaging and Reporting Data System (TI-RADS) and/or ATA criteria, while the majority of studies employed Kappa statistics. For ACR TI-RADS, the pooled Kappa was 0.51 (95% confidence interval [CI]: 0.42, 0.57; n = 7) while for ATA, the pooled Kappa was 0.52 (95% CI: 0.37, 0.67; n = 3). Due to the small number of studies, covariate or publication bias analyses were not performed. Conclusion Ultrasound criteria demonstrate moderate inter-rater reliability, but these findings are impacted by poor study quality and a lack of standardization. Laryngoscope, 2022
引用
收藏
页码:485 / 493
页数:9
相关论文
共 50 条
  • [31] Diagnostic Performance of Six Ultrasound Risk Stratification Systems for Thyroid Nodules: A Systematic Review and Network Meta-Analysis
    Kim, Do Hyun
    Kim, Sung Won
    Basurrah, Mohammed Abdullah
    Lee, Jueun
    Hwang, Se Hwan
    AMERICAN JOURNAL OF ROENTGENOLOGY, 2023, 220 (06) : 791 - 803
  • [32] Physiotherapist inter-rater reliability of the Haemophilia Early Arthropathy Detection with Ultrasound protocol
    Stephensen, D.
    Classey, S.
    Harbidge, H.
    Patel, V.
    Taylor, S.
    Wells, A.
    HAEMOPHILIA, 2018, 24 (03) : 471 - 476
  • [33] Inter-Rater Reliability of Ultrasound Imaging of the Trunk Musculature Among Novice Raters
    Teyhen, Deydre S.
    George, Steven Z.
    Dugan, Jessica L.
    Williamson, Jared
    Neilson, Brett D.
    Childs, John D.
    JOURNAL OF ULTRASOUND IN MEDICINE, 2011, 30 (03) : 347 - 356
  • [34] INTER-RATER RELIABILITY OF TRAUMATIC BRAIN INJURY DIAGNOSTIC CRITERIA: IMPACT OF INDEPENDENT REVIEW OF INTERVIEW AUDIO FILES
    Leitzke, Alyssa
    Kallenbach, Madeline
    Ristow, Georgia
    Nelson, Lindsay
    JOURNAL OF NEUROTRAUMA, 2021, 38 (14) : A42 - A42
  • [35] Inter-rater reliability of a newly developed gait analysis and motion score
    Duerregger, Christina
    Adamer, Klemens A.
    Pirchl, Michael
    Fischer, Michael J.
    JOURNAL OF ORTHOPAEDICS TRAUMA AND REHABILITATION, 2022, 29 (02)
  • [36] The Accuracy of Thyroid Nodule Ultrasound to Predict Thyroid Cancer: Systematic Review and Meta-Analysis
    Brito, Juan P.
    Gionfriddo, Michael R.
    Al Nofal, Alaa
    Boehmer, Kasey R.
    Leppin, Aaron L.
    Reading, Carl
    Callstrom, Matthew
    Elraiyah, Tarig A.
    Prokop, Larry J.
    Stan, Marius N.
    Murad, M. Hassan
    Morris, John C.
    Montori, Victor M.
    JOURNAL OF CLINICAL ENDOCRINOLOGY & METABOLISM, 2014, 99 (04) : 1253 - 1263
  • [37] Human Factor Errors in the use of the PAWPER Tape Systems: An Analysis of Inter-Rater Reliability
    Wells, Mike
    Goldstein, Lara N.
    CUREUS JOURNAL OF MEDICAL SCIENCE, 2021, 13 (01)
  • [38] Positive thyroid antibodies and risk of thyroid cancer: A systematic review and meta-analysis
    Xiao, Yang
    Zhou, Quan
    Xu, Yong
    Yuan, Song-Lin
    Liu, Qing-An
    MOLECULAR AND CLINICAL ONCOLOGY, 2019, 11 (03) : 234 - 242
  • [39] Measuring the morphological characteristics of thoracolumbar fascia in ultrasound images: an inter-rater reliability study
    De Coninck, Kyra
    Hambly, Karen
    Dickinson, John W.
    Passfield, Louis
    BMC MUSCULOSKELETAL DISORDERS, 2018, 19
  • [40] Inter-rater reliability of the Short-Term Assessment of Risk and Treatability (START)
    Timmins, Katie L. E.
    Evans, Lydia
    Tully, Ruth J.
    JOURNAL OF FORENSIC PSYCHIATRY & PSYCHOLOGY, 2018, 29 (06) : 968 - 988