Inter-Rater Reliability of Quantifying Pleural B-Lines Using Multiple Counting Methods

被引:74
|
作者
Anderson, Kenton L. [1 ,2 ]
Fields, J. Matthew [1 ,3 ]
Panebianco, Nova L. [1 ]
Jenq, Katherine Y. [4 ]
Marin, Jennifer [5 ]
Dean, Anthony J. [1 ]
机构
[1] Hosp Univ Penn, Philadelphia, PA 19104 USA
[2] San Antonio Mil Med Ctr, Dept Emergency Med, Ft Sam Houston, TX 78234 USA
[3] Thomas Jefferson Univ Hosp, Philadelphia, PA 19107 USA
[4] Penn Hosp, Philadelphia, PA 19107 USA
[5] Childrens Hosp Pittsburgh, Pittsburgh, PA 15213 USA
关键词
B-lines; inter-rater reliability; lung sonography; pleural sonography; pulmonary edema; EXTRAVASCULAR LUNG WATER; COMET-TAIL ARTIFACT; ALVEOLAR-INTERSTITIAL SYNDROME; PULMONARY-EDEMA; HEART-FAILURE; ULTRASOUND SIGN; DIAGNOSIS; SONOGRAPHY; ULTRASONOGRAPHY; PRESSURE;
D O I
10.7863/jum.2013.32.1.115
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Objectives. Sonographic B-lines are a sign of increased extravascular lung water. Several techniques for quantifying B-lines within individual rib spaces have been described, as well as different methods for "scoring" the cumulative B-line counts over the entire thorax. The interobserver reliability of these methods is unknown. This study examined 3 methods of quantifying B-lines for inter-rater reliability. Methods-Videotaped pleural assessments of adult patients presenting to the emergency department with dyspnea and suspected acute heart failure were reviewed by 3 blinded pairs of emergency physicians. Each pair performed B-line counts within single rib spaces using 1 of the following 3 predetermined methods: 1, individual B-lines are counted over an entire respiratory cycle; 2, as per method 1, but confluent B-lines are counted as multiple based on the percentage of the rib space they occupy; and 3, as per method 2, but the count is made at the moment when the most B-lines are seen, not over an entire respiratory cycle. A single-measures interclass correlation coefficient was used to assess inter-rater reliability for the 3 definitions of B-line counts. Results-A total of 456 video clips were reviewed. The interclass correlation coefficients (95% confidence intervals) for methods 1, 2, and 3 were 0.84 (0.81-0.87), 0.87 (0.85-0.90), and 0.89 (0.87-0.91), respectively. The difference between methods 1 and 3 was significant (P =.003). Conclusions-All methods of B-line quantification showed substantial inter-rater agreement. Method 3 is more reliable than method 1. There were no other significant differences between the methods. We recommend the use of method 3 because it is technically simpler to perform and more reliable than method 1.
引用
收藏
页码:115 / 120
页数:6
相关论文
共 27 条
  • [21] Inter-rater reliability of functional MRI data quality control assessments: A standardised protocol and practical guide using pyfMRIqc
    Williams, Brendan
    Hedger, Nicholas
    McNabb, Carolyn B.
    Rossetti, Gabriella M. K.
    Christakou, Anastasia
    FRONTIERS IN NEUROSCIENCE, 2023, 17
  • [22] Test-Retest, Inter-Rater and Intra-Rater Reliability for Spatiotemporal Gait Parameters Using SANE (an eaSy gAit aNalysis systEm) as Measuring Instrument
    Chaparro-Rico, Betsy D. M.
    Cafolla, Daniele
    APPLIED SCIENCES-BASEL, 2020, 10 (17):
  • [23] Inter-Rater Reliability and Agreement Among Mass-Casualty Incident Algorithms Using a Pediatric Trauma Dataset: A Pilot Study
    Fisher, Emilia H.
    Claudius, Ilene
    Kaji, Amy H.
    Shaban, Alaa
    McGlynn, Nicholas
    Cicero, Mark X.
    Santillanes, Genevieve
    Gausche-Hill, Marianne
    Chang, Todd P.
    Donofrio-Odmann, J. Joelle
    PREHOSPITAL AND DISASTER MEDICINE, 2022, 37 (03) : 306 - 313
  • [24] Checkbox grading of large-scale mathematics exams with multiple assessors: Field study on assessors' inter-rater reliability, time investment and usage experience
    Moons, Filip
    Vandervieren, Ellen
    Colpaert, Jozef
    STUDIES IN EDUCATIONAL EVALUATION, 2025, 85
  • [25] The determination of appropriate coefficient indices for inter-rater reliability: Using classroom observation instruments as fidelity measures in large-scale randomized research
    Tong, Fuhui
    Tang, Shifang
    Irby, Beverly J.
    Lara-Alecio, Rafael
    Guerrero, Cindy
    INTERNATIONAL JOURNAL OF EDUCATIONAL RESEARCH, 2020, 99
  • [26] Inter-rater and intra-rater reliability of belt-stabilized hand-held dynamometer using make test in lower extremity muscle strength assessment among healthy adults
    Espino, Reil Vinard S.
    Gonzalez-Suarez, Consuelo
    Ingram, Lewis A.
    Gomez, Ivan Neil B.
    Agarao, Kris Anthony T.
    Flores, Jazzmine Gale
    Bulseco, Nicole
    INTERNATIONAL JOURNAL OF OSTEOPATHIC MEDICINE, 2025, 56
  • [27] Assessment of medical student clinical reasoning by "lay" vs physician raters: inter-rater reliability using a scoring guide in a multidisciplinary objective structured clinical examination
    Berger, Alexandra J.
    Gillespie, Colleen C.
    Tewksbury, Linda R.
    Overstreet, Ivey M.
    Tsai, Ming C.
    Kalet, Adina L.
    Ogilvie, Jennifer B.
    AMERICAN JOURNAL OF SURGERY, 2012, 203 (01) : 81 - 86