Fast and optimal algorithm for case-control matching using registry data: application on the antibiotics use of colorectal cancer patients

被引:20
|
作者
Mamouris, Pavlos [1 ]
Nassiri, Vahid [2 ]
Molenberghs, Geert [3 ,4 ]
van den Akker, Marjan [1 ,5 ,6 ]
van der Meer, Joep [1 ]
Vaes, Bert [1 ]
机构
[1] Katholieke Univ Leuven, Dept Publ Hlth & Primary Care, Kapucijnenvoer 33,J Bldg, B-3000 Leuven, Belgium
[2] Open Analyt NV, Antwerp, Belgium
[3] Univ Leuven, KU Leuven, I BioStat, Leuven, Belgium
[4] Hasselt Univ, I BioStat, Diepenbeek, Belgium
[5] Maastricht Univ, Dept Family Med, Care & Publ Hlth Res Inst, Maastricht, Netherlands
[6] Goethe Univ, Inst Gen Practice, Frankfurt, Germany
关键词
Case-control; Optimal matching; Comorbidity index; Colorectal cancer; GENERAL-PRACTICE; RISK; COHORT; BIAS;
D O I
10.1186/s12874-021-01256-3
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background In case-control studies most algorithms allow the controls to be sampled several times, which is not always optimal. If many controls are available and adjustment for several covariates is necessary, matching without replacement might increase statistical efficiency. Comparing similar units when having observational data is of utter importance, since confounding and selection bias is present. The aim was twofold, firstly to create a method that accommodates the option that a control is not resampled, and second, to display several scenarios that identify changes of Odds Ratios (ORs) while increasing the balance of the matched sample. Methods The algorithm was derived in an iterative way starting from the pre-processing steps to derive the data until its application in a study to investigate the risk of antibiotics on colorectal cancer in the INTEGO registry (Flanders, Belgium). Different scenarios were developed to investigate the fluctuation of ORs using the combination of exact and varying variables with or without replacement of controls. To achieve balance in the population, we introduced the Comorbidity Index (CI) variable, which is the sum of chronic diseases as a means to have comparable units for drawing valid associations. Results This algorithm is fast and optimal. We simulated data and demonstrated that the run-time of matching even with millions of patients is minimal. Optimal, since the closest controls is always captured (using the appropriate ordering and by creating some auxiliary variables), and in the scenario that a case has only one control, we assure that this control will be matched to this case, thus maximizing the cases to be used in the analysis. In total, 72 different scenarios were displayed indicating the fluctuation of ORs, and revealing patterns, especially a drop when balancing the population. Conclusions We created an optimal and computationally efficient algorithm to derive a matched case-control sample with and without replacement of controls. The code and the functions are publicly available as an open source in an R package. Finally, we emphasize the importance of displaying several scenarios and assess the difference of ORs while using an index to balance population in observational data.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Statin use and the risk of colorectal cancer: A population-based case-control study
    Cheng, Meng-Hsuan
    Chiu, Hui-Fen
    Ho, Shu-Chen
    Tsai, Shang-Shyue
    Wu, Trong-Neng
    Yang, Chun-Yuh
    WORLD JOURNAL OF GASTROENTEROLOGY, 2011, 17 (47) : 5197 - 5202
  • [22] Frequent Use of Antibiotics Is Associated with Colorectal Cancer Risk: Results of a Nested Case–Control Study
    Vincent K. Dik
    Martijn G. H. van Oijen
    Hugo M. Smeets
    Peter D. Siersema
    Digestive Diseases and Sciences, 2016, 61 : 255 - 264
  • [23] Evaluation of gene-environment interactions for colorectal cancer susceptibility loci using case-only and case-control designs
    Song, Nan
    Lee, Jeeyoo
    Cho, Sooyoung
    Kim, Jeongseon
    Oh, Jae Hwan
    Shin, Aesun
    BMC CANCER, 2019, 19 (01)
  • [24] EVIDENCE FACTORS IN A CASE-CONTROL STUDY WITH APPLICATION TO THE EFFECT OF FLEXIBLE SIGMOIDOSCOPY SCREENING ON COLORECTAL CANCER
    Karmakar, Bikram
    Doubeni, Chyke A.
    Small, Dylan S.
    ANNALS OF APPLIED STATISTICS, 2020, 14 (02) : 829 - 849
  • [25] Association between Selective Serotonin Reuptake Inhibitors Use and Colorectal Cancer in a Case-Control Study
    Lai, Shih-Wei
    Lin, Cheng-Li
    Liao, Kuan-Fu
    INDIAN JOURNAL OF PHARMACEUTICAL EDUCATION AND RESEARCH, 2019, 53 (02) : 325 - 329
  • [26] Specific features of colorectal cancer in patients with metabolic syndrome: a matched case-control analysis of 772 patients
    Alban Zarzavadjian Le Bian
    Christine Denet
    Nicolas Tabchouri
    Gianfranco Donatelli
    Philippe Wind
    Christophe Louvet
    Mostefa Bennamoun
    Christos Christidis
    Thierry Perniceni
    David Fuks
    Brice Gayet
    Langenbeck's Archives of Surgery, 2018, 403 : 443 - 450
  • [27] RISK OF COLORECTAL CANCER IN DEPRESSED PATIENTS, NEGATIVE LIFE EVENTS, AND THE PREVALENCE RATE OF DEPRESSIVE SYMPTOMS: A CASE-CONTROL STUDY
    Azizi, H.
    Esmaeili, E. Davtalab
    Sayehmiri, K.
    Karimi, G.
    Asadollahi, K.
    WORLD CANCER RESEARCH JOURNAL, 2021, 8
  • [28] Statin use is associated with a reduced incidence of colorectal cancer: a colonoscopy-controlled case-control study
    Broughton, Thomas
    Sington, Jamie
    Beales, Ian L. P.
    BMC GASTROENTEROLOGY, 2012, 12
  • [29] Case-control study of colorectal carcinoma in situ and cancer in relation to cigarette smoking and alcohol use (Japan)
    Yamada, K
    Araki, S
    Tamura, M
    Sakai, I
    Takahashi, Y
    Kashihara, H
    Kono, S
    CANCER CAUSES & CONTROL, 1997, 8 (05) : 780 - 785
  • [30] Bayesian Analysis of Genetic Interactions in Case-control Studies, with Application to Adiponectin Genes and Colorectal Cancer Risk
    Yi, Nengjun
    Kaklamani, Virginia G.
    Pasche, Boris
    ANNALS OF HUMAN GENETICS, 2011, 75 : 90 - 104