Inter-Rater Reliability of Thyroid Ultrasound Risk Criteria: A Systematic Review and Meta-Analysis

被引:3
|
作者
Staibano, Phillip [1 ]
Ham, Jennifer [1 ]
Chen, Jennifer [1 ]
Zhang, Han [1 ]
Gupta, Michael K. [1 ]
机构
[1] McMaster Univ, Dept Surg, Div Otolaryngol Head & Neck Surg, 1280 Main St West, Hamilton, ON L8S 4L8, Canada
关键词
thyroid nodules; ultrasound; inter-rater reliability; systematic review; INTEROBSERVER VARIABILITY; DIAGNOSTIC PERFORMANCE; TI-RADS; AMERICAN-COLLEGE; SONOGRAPHIC FEATURES; NODULES; AGREEMENT; VALIDATION; REPRODUCIBILITY; INTRAOBSERVER;
D O I
10.1002/lary.30347
中图分类号
R-3 [医学研究方法]; R3 [基础医学];
学科分类号
1001 ;
摘要
Objective The most commonly employed diagnostic criteria for identifying thyroid nodules include Thyroid Imaging and Reporting Data System (TI-RADS) and American Thyroid Association (ATA) guidelines. The purpose of this systematic review and meta-analysis is to determine the inter-rater reliability of thyroid ultrasound criteria. Methods We performed a library search of MEDLINE (Ovid), EMBASE (Ovid), and Web of Science for full-text articles published from January 2005 to June 2022. We included full-text primary research articles that used TI-RADS and/or ATA guidelines to evaluate thyroid nodules in adults. These included studies must have calculated inter-rater reliability using any validated metric. The Quality Appraisal for Reliability Studies (QAREL) was used to assess study quality. We planned for a random-effects meta-analysis, in addition to covariate and publication bias analyses. This study was performed in accordance with Preferred Reporting Items for a Systematic Review and Meta-analysis guidelines and registered prior to conduction (International prospective register of systematic reviews-PROSPERO: CRD42021275072). Results Of the 951 articles identified via the database search, 35 met eligibility criteria. All studies were observational. The most commonly utilized criteria were ACR Thyroid Imaging and Reporting Data System (TI-RADS) and/or ATA criteria, while the majority of studies employed Kappa statistics. For ACR TI-RADS, the pooled Kappa was 0.51 (95% confidence interval [CI]: 0.42, 0.57; n = 7) while for ATA, the pooled Kappa was 0.52 (95% CI: 0.37, 0.67; n = 3). Due to the small number of studies, covariate or publication bias analyses were not performed. Conclusion Ultrasound criteria demonstrate moderate inter-rater reliability, but these findings are impacted by poor study quality and a lack of standardization. Laryngoscope, 2022
引用
收藏
页码:485 / 493
页数:9
相关论文
共 50 条
  • [1] Inter-rater reliability in performance status assessment between clinicians and patients: a systematic review and meta-analysis
    Chow, Ronald
    Zimmermann, Camilla
    Bruera, Eduardo
    Temel, Jennifer
    Im, James
    Lock, Michael
    BMJ SUPPORTIVE & PALLIATIVE CARE, 2020, 10 (02) : 129 - 135
  • [2] Inter-rater reliability in performance status assessment among healthcare professionals: an updated systematic review and meta-analysis
    Chow, Ronald
    Bruera, Eduardo
    Temel, Jennifer S.
    Krishnan, Monica
    Im, James
    Lock, Michael
    SUPPORTIVE CARE IN CANCER, 2020, 28 (05) : 2071 - 2078
  • [3] Inter-rater reliability in performance status assessment among healthcare professionals: an updated systematic review and meta-analysis
    Ronald Chow
    Eduardo Bruera
    Jennifer S. Temel
    Monica Krishnan
    James Im
    Michael Lock
    Supportive Care in Cancer, 2020, 28 : 2071 - 2078
  • [4] Evaluation of the departmental inter-rater reliability when scoring thyroid nodules according to the British Thyroid Association Ultrasound-classification model: Is there significant disagreement?
    Rtam, Nabil
    ULTRASOUND, 2024, 32 (02) : 76 - 84
  • [5] A Systematic Review and Meta-analysis on Ultrasound Detection of Thyroid Cancer in China
    Niu, Jindong
    Chen, Hongyan
    Peng, Juan
    Yuan, Hui
    IRANIAN RED CRESCENT MEDICAL JOURNAL, 2023, 25 (10)
  • [6] The inter-rater reliability of observing aggression: A systematic literature review
    Lampe, Kore G.
    Mulder, Eva A.
    Colins, Olivier F.
    Vermeiren, Robert R. J. M.
    AGGRESSION AND VIOLENT BEHAVIOR, 2017, 37 : 12 - 25
  • [7] Inter-rater reliability of the Subjective Global Assessment: A systematic literature review
    Steenson, Jessica
    Vivanti, Angela
    Isenring, Elizabeth
    NUTRITION, 2013, 29 (01) : 350 - 352
  • [8] Inter-rater reliability of trunk muscle morphometric analysis
    Valentin, Stephanie
    Yeates, Tobey DeMott
    Licka, Theresia
    Elliott, James
    JOURNAL OF BACK AND MUSCULOSKELETAL REHABILITATION, 2015, 28 (01) : 181 - 190
  • [9] Malignancy Risk of Thyroid Nodules That Are Not Classifiable by the American Thyroid Association Ultrasound Risk Stratification System: A Systematic Review and Meta-Analysis
    Kwon, Daniel
    Kulich, Marta
    Mack, Wendy J.
    Monedero, Rodrigo Martinez
    Joyo, Eri
    Angell, Trevor E.
    THYROID, 2023, 33 (05) : 593 - 602
  • [10] Inter-Rater Reliability between Structured and Non-Structured Interviews Is Fair in Schizophrenia and Bipolar Disorders-A Systematic Review and Meta-Analysis
    Rocha Neto, Helio
    Moreira, Ana Lucia R.
    Hosken, Lucas
    Langfus, Joshua A.
    Cavalcanti, Maria Tavares
    Youngstrom, Eric Arden
    Telles-Correia, Diogo
    DIAGNOSTICS, 2023, 13 (03)