Identifying Drugs Inducing Prematurity by Mining Claims Data with High-Dimensional Confounder Score Strategies

被引:3
|
作者
Demailly, Romain [1 ,2 ]
Escolano, Sylvie [1 ]
Haramburu, Francoise [3 ]
Tubert-Bitter, Pascale [1 ]
Ahmed, Ismail [1 ]
机构
[1] Univ Paris Sud, Univ Paris Saclay, High Dimens Biostat Drug Safety & Genom, UVSQ,Inserm,CESP, Villejuif, France
[2] Lille Catholic Univ, Lille Catholic Hosp, Obstet Dept, Lille, France
[3] Univ Bordeaux, Ctr Pharmacovigilance, CHU Bordeaux, UMR 1219, Bordeaux, France
关键词
HEALTH-CARE RECORDS; FOR-GESTATIONAL-AGE; DISEASE RISK SCORES; PRETERM BIRTH; PREGNANCY OUTCOMES; SUPER LEARNER; EXPOSURE; SAFETY; WOMEN; PERFORMANCE;
D O I
10.1007/s40264-020-00916-5
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Background Pregnant women are largely exposed to medications. However, knowledge is lacking about their effects on pregnancy and the fetus. Objective This study sought to evaluate the potential of high-dimensional propensity scores and high-dimensional disease risk scores for automated signal detection in pregnant women from medico-administrative databases in the context of drug-induced prematurity. Methods We used healthcare claims and hospitalization discharges of a 1/97th representative sample of the French population. We tested the association between prematurity and drug exposure during the trimester before delivery, for all drugs prescribed to at least five pregnancies. We compared different strategies (1) for building the two scores, including two machine-learning methods and (2) to account for these scores in the final logistic regression models: adjustment, weighting, and matching. We also proposed a new signal detection criterion derived from these scores: the p value relative decrease. Evaluation was performed by assessing the relevance of the signals using a literature review and clinical expertise. Results Screening 400 drugs from a cohort of 57,407 pregnancies, we observed that choosing between the two machine-learning methods had little impact on the generated signals. Score adjustment performed better than weighting and matching. Using the p value relative decrease efficiently filtered out spurious signals while maintaining a number of relevant signals similar to score adjustment. Most of the relevant signals belonged to the psychotropic class with benzodiazepines, antidepressants, and antipsychotics. Conclusions Mining complex healthcare databases with statistical methods from the high-dimensional inference field may improve signal detection in pregnant women.
引用
收藏
页码:549 / 559
页数:11
相关论文
共 50 条
  • [1] Identifying Drugs Inducing Prematurity by Mining Claims Data with High-Dimensional Confounder Score Strategies
    Romain Demailly
    Sylvie Escolano
    Françoise Haramburu
    Pascale Tubert-Bitter
    Ismaïl Ahmed
    Drug Safety, 2020, 43 : 549 - 559
  • [2] Mining high-dimensional administrative claims data to predict early hospital readmissions
    He, Danning
    Mathews, Simon C.
    Kalloo, Anthony N.
    Hutfless, Susan
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2014, 21 (02) : 272 - 279
  • [3] On the role of marginal confounder prevalence - implications for the high-dimensional propensity score algorithm
    Schuster, Tibor
    Pang, Menglan
    Platt, Robert W.
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2015, 24 (09) : 1004 - 1007
  • [4] Visualization and data mining of high-dimensional data
    Inselberg, A
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2002, 60 (1-2) : 147 - 159
  • [5] Data Mining for High-Dimensional Measurement Systems
    Mikut, Ralf
    TM-TECHNISCHES MESSEN, 2010, 77 (10) : 524 - 529
  • [6] Implementing high-dimensional propensity score principles to improve confounder adjustment inUKelectronic health records
    Tazare, John
    Smeeth, Liam
    Evans, Stephen J. W.
    Williamson, Elizabeth
    Douglas, Ian J.
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2020, 29 (11) : 1373 - 1381
  • [7] High-dimensional Propensity Score Adjustment in Studies of Treatment Effects Using Health Care Claims Data
    Schneeweiss, Sebastian
    Rassen, Jeremy A.
    Glynn, Robert J.
    Avorn, Jerry
    Mogun, Helen
    Brookhart, M. Alan
    EPIDEMIOLOGY, 2009, 20 (04) : 512 - 522
  • [8] An efficient clustering method of data mining for high-dimensional data
    Chang, JW
    Kang, HM
    8TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL II, PROCEEDINGS: COMPUTING TECHNIQUES, 2004, : 273 - 278
  • [9] Identifying a Minimal Class of Models for High-dimensional Data
    Nevo, Daniel
    Ritov, Ya'acov
    JOURNAL OF MACHINE LEARNING RESEARCH, 2017, 18
  • [10] A COMPARISON OF HIGH-DIMENSIONAL PROPENSITY SCORE AND TRADITIONAL PROPENSITY SCORE MATCHING METHODS USING COMMERCIAL HEALTH CARE CLAIMS DATA
    Faccone, J.
    Wang, Y.
    VALUE IN HEALTH, 2019, 22 : S317 - S318