A genetic algorithm for simulating correlated binary data from biomedical research

被引:8
|
作者
Kruppa, Jochen [1 ]
Lepenies, Bernd [2 ,3 ]
Jung, Klaus [1 ,3 ]
机构
[1] Univ Vet Med Hannover, Inst Anim Breeding & Genet, Bunteweg 17p, D-30559 Hannover, Germany
[2] Univ Vet Med Hannover, Immunol Unit, Hannover, Germany
[3] Univ Vet Med Hannover, Res Ctr Emerging Infect & Zoonoses RIZ, Hannover, Germany
关键词
Correlated binary data; Genetic algorithm; High-dimensional data; Random number generation; Computer simulation; DISTRIBUTIONS; ASSOCIATION; VARIABLES; MODELS;
D O I
10.1016/j.compbiomed.2017.10.023
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Correlated binary data arise in a large variety of biomedical research. In order to evaluate methods for their analysis, computer simulations of such data are often required. Existing methods can often not cover the full range of possible correlations between the variables or are not available as implemented software. We propose a genetic algorithm that approaches the desired correlation structure under a given marginal distribution. The procedure generates a large representative matrix from which the probabilities of individual observations can be derived or from which samples can be drawn directly. Our genetic algorithm is evaluated under different specified marginal frequencies and correlation structures, and is compared against two existing approaches. The evaluation checks the speed and precision of the approach as well as its suitability for generating also high-dimensional data. In an example of high-throughput glycan array data, we demonstrate the usability of our approach to simulate the power of global test procedures. An implementation of our own and two other methods were added to the R package `RepeatedHighDim'. The presented algorithm is not restricted to certain correlation structures. In contrast to existing methods it is also evaluated for high-dimensional data.
引用
收藏
页码:1 / 8
页数:8
相关论文
共 50 条
  • [31] Goodness-of-fit tests for correlated paired binary data
    Tang, Man-Lai
    Pei, Yan-Bo
    Wong, Weng-Kee
    Li, Jia-Liang
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2012, 21 (04) : 331 - 345
  • [32] Wave steepness retrieved from scatterometer data in a genetic algorithm
    过杰
    何宜军
    Journal of Oceanology and Limnology, 2012, (02) : 336 - 341
  • [33] Wave steepness retrieved from scatterometer data in a genetic algorithm
    Guo Jie
    He Yijun
    CHINESE JOURNAL OF OCEANOLOGY AND LIMNOLOGY, 2012, 30 (02) : 336 - 341
  • [34] Wave steepness retrieved from scatterometer data in a genetic algorithm
    Jie Guo
    Yijun He
    Chinese Journal of Oceanology and Limnology, 2012, 30 : 336 - 341
  • [35] A Binary Morphology-Based Clustering Algorithm Directed by Genetic Algorithm
    Pedrino, E. C.
    Nicoletti, M. C.
    Saito, J. H.
    Cura, L. M. V.
    Roda, V. O.
    2013 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2013), 2013, : 409 - 414
  • [36] A New Genetic Algorithm Based Technique for Biomedical Image Enhancement
    Khatkar, Kirti
    Kumar, Dinesh
    PERTANIKA JOURNAL OF SCIENCE AND TECHNOLOGY, 2018, 26 (04): : 1725 - 1749
  • [37] MOLECULAR RECOGNITION USING A BINARY GENETIC SEARCH ALGORITHM
    PAYNE, AWR
    GLEN, RC
    JOURNAL OF MOLECULAR GRAPHICS, 1993, 11 (02): : 74 - &
  • [38] Design of binary rotation invariant filters with genetic algorithm
    Singher, L
    Ersoy, OK
    Miles, GE
    ALGORITHMS, DEVICES, AND SYSTEMS FOR OPTICAL INFORMATION PROCESSING, 1998, 3466 : 74 - 79
  • [39] A Genetic Algorithm for Constructing Compact Binary Decision Trees
    Cha, Sung-Hyuk
    Tappert, Charles
    JOURNAL OF PATTERN RECOGNITION RESEARCH, 2009, 4 (01): : 1 - 13
  • [40] A Processor for Genetic Algorithm based on Redundant Binary Number
    Aoshima, Masanao
    Kanasugi, Akinori
    THIRD 2008 INTERNATIONAL CONFERENCE ON CONVERGENCE AND HYBRID INFORMATION TECHNOLOGY, VOL 1, PROCEEDINGS, 2008, : 581 - 586