Multiple Self-Controlled Case Series for Large-Scale Longitudinal Observational Databases

被引:26
作者
Simpson, Shawn E. [1 ]
Madigan, David [1 ]
Zorych, Ivan [1 ]
Schuemie, Martijn J. [2 ]
Ryan, Patrick B. [3 ]
Suchard, Marc A. [4 ,5 ,6 ]
机构
[1] Columbia Univ, Dept Stat, New York, NY 10027 USA
[2] Erasmus Univ, Dept Med Informat, Med Ctr Rotterdam, NL-3000 DR Rotterdam, Netherlands
[3] Johnson & Johnson Pharmaceut Res & Dev, Titusville, NJ USA
[4] Univ Calif Los Angeles, David Geffen Sch Med, Dept Biomath, Los Angeles, CA 90095 USA
[5] Univ Calif Los Angeles, David Geffen Sch Med, Dept Human Genet, Los Angeles, CA 90095 USA
[6] Univ Calif Los Angeles, UCLA Fielding Sch Publ Hlth, Dept Biostat, Los Angeles, CA USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
Big Data; Conditional Poisson regression; Cyclic coordinate descent; Drug safety; Postmarketing surveillance; Regularized regression; Self-controlled case series; Statistical computing; REGRESSION; SAFETY; RISK; MODEL; DEPENDENCE; SELECTION; VACCINE; DESIGN;
D O I
10.1111/biom.12078
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Characterization of relationships between time-varying drug exposures and adverse events (AEs) related to health outcomes represents the primary objective in postmarketing drug safety surveillance. Such surveillance increasingly utilizes large-scale longitudinal observational databases (LODs), containing time-stamped patient-level medical information including periods of drug exposure and dates of diagnoses for millions of patients. Statistical methods for LODs must confront computational challenges related to the scale of the data, and must also address confounding and other biases that can undermine efforts to estimate effect sizes. Methods that compare on-drug with off-drug periods within patient offer specific advantages over between patient analysis on both counts. To accomplish these aims, we extend the self-controlled case series (SCCS) for LODs. SCCS implicitly controls for fixed multiplicative baseline covariates since each individual acts as their own control. In addition, only exposed cases are required for the analysis, which is computationally advantageous. The standard SCCS approach is usually used to assess single drugs and therefore estimates marginal associations between individual drugs and particular AEs. Such analyses ignore confounding drugs and interactions and have the potential to give misleading results. In order to avoid these difficulties, we propose a regularized multiple SCCS approach that incorporates potentially thousands or more of time-varying confounders such as other drugs. The approach successfully handles the high dimensionality and can provide a sparse solution via an L1 regularizer. We present details of the model and the associated optimization procedure, as well as results of empirical investigations.
引用
收藏
页码:893 / 902
页数:10
相关论文
共 29 条
[1]  
[Anonymous], 2006, Journal of the Royal Statistical Society, Series B
[2]   The use of the area under the roc curve in the evaluation of machine learning algorithms [J].
Bradley, AP .
PATTERN RECOGNITION, 1997, 30 (07) :1145-1159
[3]   Exposure to antipsychotics and risk of stroke: self controlled case series study [J].
Douglas, Ian J. ;
Smeeth, Liam .
BRITISH MEDICAL JOURNAL, 2008, 337 (7670) :616-618
[4]   Semiparametric analysis of case series data [J].
Farrington, C. P. ;
Whitaker, H. J. .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2006, 55 :553-580
[5]  
Farrington CP, 2010, J R STAT SOC C-APPL, V59, P457
[6]   RELATIVE INCIDENCE ESTIMATION FROM CASE SERIES FOR VACCINE SAFETY EVALUATION [J].
FARRINGTON, CP .
BIOMETRICS, 1995, 51 (01) :228-235
[7]  
Fram D.M., 2003, Proceedings of the ninth acm sigkdd international conference on knowledge discovery and data mining, P359, DOI DOI 10.1145/956750.956792
[8]   Regularization Paths for Generalized Linear Models via Coordinate Descent [J].
Friedman, Jerome ;
Hastie, Trevor ;
Tibshirani, Rob .
JOURNAL OF STATISTICAL SOFTWARE, 2010, 33 (01) :1-22
[9]   Large-scale Bayesian logistic regression for text categorization [J].
Genkin, Alexander ;
Lewis, David D. ;
Madigan, David .
TECHNOMETRICS, 2007, 49 (03) :291-304
[10]   Post-marketing assessment of the safety of strontium ranelate; a novel case-only approach to the early detection of adverse drug reactions [J].
Grosso, Anthony ;
Douglas, Ian ;
Hingorani, Aroon ;
MacAllister, Raymond ;
Smeeth, Liam .
BRITISH JOURNAL OF CLINICAL PHARMACOLOGY, 2008, 66 (05) :689-694