Automated data-adaptive analytics for electronic healthcare data to study causal treatment effects

被引:59
作者
Schneeweiss, Sebastian [1 ,2 ]
机构
[1] Brigham & Womens Hosp, Dept Med, Div Pharmacoepidemiol & Pharmacoecon, 1 Brigham Circle, Boston, MA 02120 USA
[2] Harvard Med Sch, 1 Brigham Circle, Boston, MA 02120 USA
基金
美国国家卫生研究院;
关键词
high-dimensional data; confounding (epidemiology); health care databases; real-world data; confounding adjustment; propensity scores; automation; causal conclusions; artificial intelligence; machine learning; DIMENSIONAL PROPENSITY SCORE; DISEASE RISK SCORES; MARGINAL STRUCTURAL MODELS; CONFOUNDING ADJUSTMENT; VARIABLE SELECTION; COMPARATIVE SAFETY; GASTROINTESTINAL TOXICITY; LOGISTIC-REGRESSION; MATCHED COHORT; DRUG;
D O I
10.2147/CLEP.S166545
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Background: Decision makers in health care increasingly rely on nonrandomized database analyses to assess the effectiveness, safety, and value of medical products. Health care data scientists use data-adaptive approaches that automatically optimize confounding control to study causal treatment effects. This article summarizes relevant experiences and extensions. Methods: The literature was reviewed on the uses of high-dimensional propensity score (HDPS) and related approaches for health care database analyses, including methodological articles on their performance and improvement. Articles were grouped into applications, comparative performance studies, and statistical simulation experiments. Results: The HDPS algorithm has been referenced frequently with a variety of clinical applications and data sources from around the world. The appeal of HDPS for database research rests in 1) its superior performance in situations of unobserved confounding through proxy adjustment, 2) its predictable efficiency in extracting confounding information from a given data source, 3) its ability to automate estimation of causal treatment effects to the extent achievable in a given data source, and 4) its independence of data source and coding system. Extensions of the HDPS approach have focused on improving variable selection when exposure is sparse, using free text information and time-varying confounding adjustment. Conclusion: Semiautomated and optimized confounding adjustment in health care database analyses has proven successful across a wide range of settings. Machine-learning extensions further automate its use in estimating causal treatment effects across a range of data scenarios.
引用
收藏
页码:771 / 788
页数:18
相关论文
共 120 条
[1]  
Angrist JD, 1996, J AM STAT ASSOC, V91, P444, DOI 10.2307/2291629
[2]  
[Anonymous], 1995, BAYESIAN DATA ANAL, DOI [DOI 10.1201/9780429258411, 10.1201/9780429258411]
[3]  
[Anonymous], 2001, Econometric Analysis of Cross-Section and Panel Data
[4]  
Arana A., 2004, PHARMACOEPIDEMIOLOGI, V13, pS330
[5]   Use of disease risk scores in pharmacoepidemiologic studies [J].
Arbogast, Patrick G. ;
Ray, Wayne A. .
STATISTICAL METHODS IN MEDICAL RESEARCH, 2009, 18 (01) :67-80
[6]   Balance diagnostics for comparing the distribution of baseline covariates between treatment groups in propensity-score matched samples [J].
Austin, Peter C. .
STATISTICS IN MEDICINE, 2009, 28 (25) :3083-3107
[7]   The use of pioglitazone and the risk of bladder cancer in people with type 2 diabetes: nested case-control study [J].
Azoulay, Laurent ;
Yin, Hui ;
Filion, Kristian B. ;
Assayag, Jonathan ;
Majdan, Agnieszka ;
Pollak, Michael N. ;
Suissa, Samy .
BMJ-BRITISH MEDICAL JOURNAL, 2012, 344
[8]   Comparison of upper gastrointestinal toxicity of rofecoxib and naproxen in patients with rheumatoid arthritis. [J].
Bombardier, C ;
Laine, L ;
Reicin, A ;
Shapiro, D ;
Burgos-Vargas, R ;
Davis, B ;
Day, R ;
Ferraz, MB ;
Hawkey, CJ ;
Hochberg, MC ;
Kvien, TK ;
Schnitzer, TJ ;
Weaver, A .
NEW ENGLAND JOURNAL OF MEDICINE, 2000, 343 (21) :1520-1528
[9]   Variable selection for propensity score models [J].
Brookhart, M. Alan ;
Schneeweiss, Sebastian ;
Rothman, Kenneth J. ;
Glynn, Robert J. ;
Avorn, Jerry ;
Sturmer, Til .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 2006, 163 (12) :1149-1156
[10]   Instrumental variable methods in comparative safety and effectiveness research [J].
Brookhart, M. Alan ;
Rassen, Jeremy A. ;
Schneeweiss, Sebastian .
PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2010, 19 (06) :537-554