Automated data-adaptive analytics for electronic healthcare data to study causal treatment effects

被引:59
作者
Schneeweiss, Sebastian [1 ,2 ]
机构
[1] Brigham & Womens Hosp, Dept Med, Div Pharmacoepidemiol & Pharmacoecon, 1 Brigham Circle, Boston, MA 02120 USA
[2] Harvard Med Sch, 1 Brigham Circle, Boston, MA 02120 USA
基金
美国国家卫生研究院;
关键词
high-dimensional data; confounding (epidemiology); health care databases; real-world data; confounding adjustment; propensity scores; automation; causal conclusions; artificial intelligence; machine learning; DIMENSIONAL PROPENSITY SCORE; DISEASE RISK SCORES; MARGINAL STRUCTURAL MODELS; CONFOUNDING ADJUSTMENT; VARIABLE SELECTION; COMPARATIVE SAFETY; GASTROINTESTINAL TOXICITY; LOGISTIC-REGRESSION; MATCHED COHORT; DRUG;
D O I
10.2147/CLEP.S166545
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Background: Decision makers in health care increasingly rely on nonrandomized database analyses to assess the effectiveness, safety, and value of medical products. Health care data scientists use data-adaptive approaches that automatically optimize confounding control to study causal treatment effects. This article summarizes relevant experiences and extensions. Methods: The literature was reviewed on the uses of high-dimensional propensity score (HDPS) and related approaches for health care database analyses, including methodological articles on their performance and improvement. Articles were grouped into applications, comparative performance studies, and statistical simulation experiments. Results: The HDPS algorithm has been referenced frequently with a variety of clinical applications and data sources from around the world. The appeal of HDPS for database research rests in 1) its superior performance in situations of unobserved confounding through proxy adjustment, 2) its predictable efficiency in extracting confounding information from a given data source, 3) its ability to automate estimation of causal treatment effects to the extent achievable in a given data source, and 4) its independence of data source and coding system. Extensions of the HDPS approach have focused on improving variable selection when exposure is sparse, using free text information and time-varying confounding adjustment. Conclusion: Semiautomated and optimized confounding adjustment in health care database analyses has proven successful across a wide range of settings. Machine-learning extensions further automate its use in estimating causal treatment effects across a range of data scenarios.
引用
收藏
页码:771 / 788
页数:18
相关论文
共 120 条
[21]   The Potential of High-Dimensional Propensity Scores in Health Services Research: An Exemplary Study on the Quality of Care for Elective Percutaneous Coronary Interventions [J].
Enders, Dirk ;
Ohlmeier, Christoph ;
Garbe, Edeltraut .
HEALTH SERVICES RESEARCH, 2018, 53 (01) :197-213
[22]   Supplementary data collection with case-cohort analysis to address potential confounding in a cohort study of thromboembolism in oral contraceptive initiators matched on claims-based propensity scores [J].
Eng, P. Mona ;
Seeger, John D. ;
Loughlin, Jeanne ;
Clifford, C. Robin ;
Mentor, Sherry ;
Walker, Alexander M. .
PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2008, 17 (03) :297-305
[23]   A Multicenter Observational Study of Incretin-based Drugs and Heart Failure [J].
Filion, Kristian B. ;
Azoulay, Laurent ;
Platt, Robert W. ;
Dahl, Matthew ;
Dormuth, Colin R. ;
Clemens, Kristin K. ;
Hu, Nianping ;
Paterson, J. Michael ;
Targownik, Laura ;
Turin, Tanvir C. ;
Udell, Jacob A. ;
Ernst, Pierre .
NEW ENGLAND JOURNAL OF MEDICINE, 2016, 374 (12) :1145-1154
[24]   Proton pump inhibitors and the risk of hospitalisation for community-acquired pneumonia: replicated cohort studies with meta-analysis [J].
Filion, Kristian B. ;
Chateau, Dan ;
Targownik, Laura E. ;
Gershon, Andrea ;
Durand, Madeleine ;
Tamim, Hala ;
Teare, Gary F. ;
Ravani, Pietro ;
Ernst, Pierre ;
Dormuth, Colin R. .
GUT, 2014, 63 (04) :552-558
[25]   When and How Can Real World Data Analyses Substitute for Randomized Controlled Trials? [J].
Franklin, Jessica M. ;
Schneeweiss, Sebastian .
CLINICAL PHARMACOLOGY & THERAPEUTICS, 2017, 102 (06) :924-933
[26]   Comparing the performance of propensity score methods in healthcare database studies with rare outcomes [J].
Franklin, Jessica M. ;
Eddings, Wesley ;
Austin, Peter C. ;
Stuart, Elizabeth A. ;
Schneeweiss, Sebastian .
STATISTICS IN MEDICINE, 2017, 36 (12) :1946-1963
[27]   Regularized Regression Versus the High-Dimensional Propensity Score for Confounding Adjustment in Secondary Database Analyses [J].
Franklin, Jessica M. ;
Eddings, Wesley ;
Glynn, Robert J. ;
Schneeweiss, Sebastian .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 2015, 182 (07) :651-659
[28]   Metrics for covariate balance in cohort studies of causal effects [J].
Franklin, Jessica M. ;
Rassen, Jeremy A. ;
Ackermann, Diana ;
Bartels, Dorothee B. ;
Schneeweiss, Sebastian .
STATISTICS IN MEDICINE, 2014, 33 (10) :1685-1699
[29]   Plasmode simulation for the evaluation of pharmacoepidemiologic methods in complex healthcare databases [J].
Franklin, Jessica M. ;
Schneeweiss, Sebastian ;
Polinski, Jennifer M. ;
Rassen, Jeremy A. .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 72 :219-226
[30]   Toward a science of learning systems: a research agenda for the high-functioning Learning Health System [J].
Friedman, Charles ;
Rubin, Joshua ;
Brown, Jeffrey ;
Buntin, Melinda ;
Corn, Milton ;
Etheredge, Lynn ;
Gunter, Carl ;
Musen, Mark ;
Platt, Richard ;
Stead, William ;
Sullivan, Kevin ;
Van Houweling, Douglas .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2015, 22 (01) :43-50