Validation of Diagnostic Groups Based on Health Care Utilization Data Should Adjust for Sampling Strategy

被引:2
作者
Cadieux, Genevieve [1 ,2 ]
Tamblyn, Robyn [2 ,3 ]
Buckeridge, David L. [2 ,4 ]
Dendukuri, Nandini [2 ]
机构
[1] Univ Toronto, Dalla Lana Sch Publ Hlth, 155 Coll St,6th Floor, Toronto, ON M5T 3M7, Canada
[2] McGill Univ, Dept Epidemiol Biostat & Occupat Hlth, Montreal, PQ, Canada
[3] McGill Univ, Direct Sante Publ Montreal, Montreal, PQ, Canada
[4] McGill Univ, Dept Med, Montreal, PQ, Canada
关键词
verification bias; validation studies; diagnostic groups; healthcare utilization data; stratified sampling; surveillance; MEDICARE CLAIMS DATA; ADMINISTRATIVE DATABASES; OUTCOMES RESEARCH; SYNDROMIC SURVEILLANCE; QUALITY IMPROVEMENT; PHYSICIAN CLAIMS; RISK-FACTORS; ACCURACY; CODES; INFORMATION;
D O I
10.1097/MLR.0000000000000324
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Objective: Valid measurement of outcomes such as disease prevalence using health care utilization data is fundamental to the implementation of a "learning health system." Definitions of such outcomes can be complex, based on multiple diagnostic codes. The literature on validating such data demonstrates a lack of awareness of the need for a stratified sampling design and corresponding statistical methods. We propose a method for validating the measurement of diagnostic groups that have: (1) different prevalences of diagnostic codes within the group; and (2) low prevalence. Methods: We describe an estimation method whereby: (1) low-prevalence diagnostic codes are oversampled, and the positive predictive value (PPV) of the diagnostic group is estimated as a weighted average of the PPV of each diagnostic code; and (2) claims that fall within a low-prevalence diagnostic group are oversampled relative to claims that are not, and bias-adjusted estimators of sensitivity and specificity are generated. Application: We illustrate our proposed method using an example from population health surveillance in which diagnostic groups are applied to physician claims to identify cases of acute respiratory illness. Conclusions: Failure to account for the prevalence of each diagnostic code within a diagnostic group leads to the underestimation of the PPV, because low-prevalence diagnostic codes are more likely to be false positives. Failure to adjust for oversampling of claims that fall within the low-prevalence diagnostic group relative to those that do not leads to the overestimation of sensitivity and underestimation of specificity.
引用
收藏
页码:E59 / E67
页数:9
相关论文
共 38 条
  • [1] Towards a Unified Taxonomy of Health Indicators: Academic Health Centers and Communities Working Together to Improve Population Health
    Aguilar-Gaxiola, Sergio
    Ahmed, Syed
    Franco, Zeno
    Kissack, Anne
    Gabriel, Davera
    Hurd, Thelma
    Ziegahn, Linda
    Bates, Nancy J.
    Calhoun, Karen
    Carter-Edwards, Lori
    Corbie-Smith, Giselle
    Eder, Milton Mickey
    Ferrans, Carol
    Hacker, Karen
    Rumala, Bernice B.
    Strelnick, A. Hal
    Wallerstein, Nina
    [J]. ACADEMIC MEDICINE, 2014, 89 (04) : 564 - 572
  • [2] [Anonymous], 2003, DEF DIS ASS CRIT BIO
  • [3] ASSESSMENT OF DIAGNOSTIC-TESTS WHEN DISEASE VERIFICATION IS SUBJECT TO SELECTION BIAS
    BEGG, CB
    GREENES, RA
    [J]. BIOMETRICS, 1983, 39 (01) : 207 - 215
  • [4] Identifying patient preoperative risk factors and postoperative adverse events in administrative databases: Results from the Department of Veterans Affairs National Surgical Quality Improvement Program
    Best, WR
    Khuri, SF
    Phelan, M
    Hur, K
    Henderson, WG
    Demakis, JG
    Daley, J
    [J]. JOURNAL OF THE AMERICAN COLLEGE OF SURGEONS, 2002, 194 (03) : 257 - 266
  • [5] Evaluation of ICD-9 codes for syndromic surveillance in the electronic surveillance system for the early notification of community-based epidemics
    Betancourt, Jose A.
    Hakre, Shilpa
    Polyak, Christina S.
    Pavlin, Julie A.
    [J]. MILITARY MEDICINE, 2007, 172 (04) : 346 - 352
  • [6] Accuracy of ICD-9-CM codes for identifying cardiovascular and stroke risk factors
    Birman-Deych, E
    Waterman, AD
    Yan, Y
    Nilasena, DS
    Radford, MJ
    Gage, BF
    [J]. MEDICAL CARE, 2005, 43 (05) : 480 - 485
  • [7] The "Meaningful Use" Regulation for Electronic Health Records
    Blumenthal, David
    Tavenner, Marilyn
    [J]. NEW ENGLAND JOURNAL OF MEDICINE, 2010, 363 (06) : 501 - 504
  • [8] Accuracy of syndrome definitions based on diagnoses in physician claims
    Cadieux, Genevieve
    Buckeridge, David L.
    Jacques, Andre
    Libman, Michael
    Dendukuri, Nandini
    Tamblyn, Robyn
    [J]. BMC PUBLIC HEALTH, 2011, 11
  • [9] Telehealth Ontario Detection of Gastrointestinal Illness Outbreaks
    Caudle, Jaelyn M.
    van Dijk, Adam
    Rolland, Elizabeth
    Moore, Kieran M.
    [J]. CANADIAN JOURNAL OF PUBLIC HEALTH-REVUE CANADIENNE DE SANTE PUBLIQUE, 2009, 100 (04): : 253 - 256
  • [10] Generating a reliable reference standard set for syndromic case classification
    Chapman, WW
    Dowling, JN
    Wagner, MM
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2005, 12 (06) : 618 - 629