Fast and optimal algorithm for case-control matching using registry data: application on the antibiotics use of colorectal cancer patients

被引:20
|
作者
Mamouris, Pavlos [1 ]
Nassiri, Vahid [2 ]
Molenberghs, Geert [3 ,4 ]
van den Akker, Marjan [1 ,5 ,6 ]
van der Meer, Joep [1 ]
Vaes, Bert [1 ]
机构
[1] Katholieke Univ Leuven, Dept Publ Hlth & Primary Care, Kapucijnenvoer 33,J Bldg, B-3000 Leuven, Belgium
[2] Open Analyt NV, Antwerp, Belgium
[3] Univ Leuven, KU Leuven, I BioStat, Leuven, Belgium
[4] Hasselt Univ, I BioStat, Diepenbeek, Belgium
[5] Maastricht Univ, Dept Family Med, Care & Publ Hlth Res Inst, Maastricht, Netherlands
[6] Goethe Univ, Inst Gen Practice, Frankfurt, Germany
关键词
Case-control; Optimal matching; Comorbidity index; Colorectal cancer; GENERAL-PRACTICE; RISK; COHORT; BIAS;
D O I
10.1186/s12874-021-01256-3
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background In case-control studies most algorithms allow the controls to be sampled several times, which is not always optimal. If many controls are available and adjustment for several covariates is necessary, matching without replacement might increase statistical efficiency. Comparing similar units when having observational data is of utter importance, since confounding and selection bias is present. The aim was twofold, firstly to create a method that accommodates the option that a control is not resampled, and second, to display several scenarios that identify changes of Odds Ratios (ORs) while increasing the balance of the matched sample. Methods The algorithm was derived in an iterative way starting from the pre-processing steps to derive the data until its application in a study to investigate the risk of antibiotics on colorectal cancer in the INTEGO registry (Flanders, Belgium). Different scenarios were developed to investigate the fluctuation of ORs using the combination of exact and varying variables with or without replacement of controls. To achieve balance in the population, we introduced the Comorbidity Index (CI) variable, which is the sum of chronic diseases as a means to have comparable units for drawing valid associations. Results This algorithm is fast and optimal. We simulated data and demonstrated that the run-time of matching even with millions of patients is minimal. Optimal, since the closest controls is always captured (using the appropriate ordering and by creating some auxiliary variables), and in the scenario that a case has only one control, we assure that this control will be matched to this case, thus maximizing the cases to be used in the analysis. In total, 72 different scenarios were displayed indicating the fluctuation of ORs, and revealing patterns, especially a drop when balancing the population. Conclusions We created an optimal and computationally efficient algorithm to derive a matched case-control sample with and without replacement of controls. The code and the functions are publicly available as an open source in an R package. Finally, we emphasize the importance of displaying several scenarios and assess the difference of ORs while using an index to balance population in observational data.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Fast and optimal algorithm for case-control matching using registry data: application on the antibiotics use of colorectal cancer patients
    Pavlos Mamouris
    Vahid Nassiri
    Geert Molenberghs
    Marjan van den Akker
    Joep van der Meer
    Bert Vaes
    BMC Medical Research Methodology, 21
  • [2] Frequent Use of Antibiotics Is Associated with Colorectal Cancer Risk: Results of a Nested Case-Control Study
    Dik, Vincent K.
    van Oijen, Martijn G. H.
    Smeets, Hugo M.
    Siersema, Peter D.
    DIGESTIVE DISEASES AND SCIENCES, 2016, 61 (01) : 255 - 264
  • [3] Beta blocker use and colorectal cancer risk Population-based case-control study
    Jansen, Lina
    Below, Janina
    Chang-Claude, Jenny
    Brenner, Hermann
    Hoffmeister, Michael
    CANCER, 2012, 118 (16) : 3911 - 3919
  • [4] Emulating a target trial in case-control designs: an application to statins and colorectal cancer
    Dickerman, Barbra A.
    Garcia-Albeniz, Xabier
    Logan, Roger W.
    Denaxas, Spiros
    Hernan, Miguel A.
    INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2020, 49 (05) : 1637 - 1646
  • [5] Identification of patients with non-metastatic colorectal cancer in primary care: a case-control study
    Ewing, Marcela
    Naredi, Peter
    Zhang, Chenyang
    Mansson, Jorgen
    BRITISH JOURNAL OF GENERAL PRACTICE, 2016, 66 (653) : E880 - E886
  • [6] On the equivalence of posterior inference based on retrospective and prospective likelihoods: application to a case-control study of colorectal cancer
    Ghosh, M.
    Song, J.
    Forster, J. J.
    Mitra, R.
    Mukherjee, B.
    STATISTICS IN MEDICINE, 2012, 31 (20) : 2196 - 2208
  • [7] Statin use and risk of endometrial cancer: a nationwide registry-based case-control study
    Sperling, Cecilie D.
    Verdoodt, Freija
    Friis, Soren
    Dehlendorff, Christian
    Kjaer, Susanne K.
    ACTA OBSTETRICIA ET GYNECOLOGICA SCANDINAVICA, 2017, 96 (02) : 144 - 149
  • [8] Examining the association between cigarette smoking and colorectal cancer using historical case-control data
    Peppone, Luke J.
    Hyland, Andrew
    Moysich, Kirsten B.
    Reid, Mary E.
    Piazza, Kenneth M.
    Purnell, Jason Q.
    Mustian, Karen M.
    Morrow, Gary R.
    CANCER EPIDEMIOLOGY, 2009, 33 (3-4) : 182 - 188
  • [9] Association between colorectal cancer and zolpidem use in a case-control study
    Lai, Shih-Wei
    Lin, Cheng-Li
    Liao, Kuan-Fu
    MEDICINE, 2019, 98 (48)
  • [10] Antidepressant use and colorectal cancer risk: a Danish population-based case-control study
    Cronin-Fenton, D. P.
    Riis, A. H.
    Lash, T. L.
    Dalton, S. O.
    Friis, S.
    Robertson, D.
    Sorensen, H. T.
    BRITISH JOURNAL OF CANCER, 2011, 104 (01) : 188 - 192