A genetic algorithm for simulating correlated binary data from biomedical research

被引:8
|
作者
Kruppa, Jochen [1 ]
Lepenies, Bernd [2 ,3 ]
Jung, Klaus [1 ,3 ]
机构
[1] Univ Vet Med Hannover, Inst Anim Breeding & Genet, Bunteweg 17p, D-30559 Hannover, Germany
[2] Univ Vet Med Hannover, Immunol Unit, Hannover, Germany
[3] Univ Vet Med Hannover, Res Ctr Emerging Infect & Zoonoses RIZ, Hannover, Germany
关键词
Correlated binary data; Genetic algorithm; High-dimensional data; Random number generation; Computer simulation; DISTRIBUTIONS; ASSOCIATION; VARIABLES; MODELS;
D O I
10.1016/j.compbiomed.2017.10.023
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Correlated binary data arise in a large variety of biomedical research. In order to evaluate methods for their analysis, computer simulations of such data are often required. Existing methods can often not cover the full range of possible correlations between the variables or are not available as implemented software. We propose a genetic algorithm that approaches the desired correlation structure under a given marginal distribution. The procedure generates a large representative matrix from which the probabilities of individual observations can be derived or from which samples can be drawn directly. Our genetic algorithm is evaluated under different specified marginal frequencies and correlation structures, and is compared against two existing approaches. The evaluation checks the speed and precision of the approach as well as its suitability for generating also high-dimensional data. In an example of high-throughput glycan array data, we demonstrate the usability of our approach to simulate the power of global test procedures. An implementation of our own and two other methods were added to the R package `RepeatedHighDim'. The presented algorithm is not restricted to certain correlation structures. In contrast to existing methods it is also evaluated for high-dimensional data.
引用
收藏
页码:1 / 8
页数:8
相关论文
共 50 条
  • [41] GENETIC ALGORITHM FOR BINARY AND FUNCTIONAL DECISION DIAGRAMS OPTIMIZATION
    Stojkovic, Suzana
    Velickovic, Darko
    Moraga, Claudio
    FACTA UNIVERSITATIS-SERIES ELECTRONICS AND ENERGETICS, 2018, 31 (02) : 169 - 187
  • [42] An Empirical Study of Univariate and Genetic Algorithm-Based Feature Selection in Binary Classification with Microarray Data
    Lecocke, Michael
    Hess, Kenneth
    CANCER INFORMATICS, 2006, 2 : 313 - 327
  • [43] Research on Structure Based on the Adaptive a sort of the Amending Genetic Algorithm
    Zhang, Kehong
    2011 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), VOLS 1-4, 2012, : 1787 - 1791
  • [44] Improved genetic algorithm for optimization of binary phase holograms
    Nguyen, TA
    An, JW
    Choi, JK
    Kim, N
    EMERGING OPTOELECTRONIC APPLICATIONS, 2004, 5363 : 183 - 191
  • [45] Image Encryption Using Genetic Algorithm and Binary Patterns
    Afarin, Roza
    Mozaffari, Saeed
    2013 10TH INTERNATIONAL ISC CONFERENCE ON INFORMATION SECURITY AND CRYPTOLOGY (ISCISC), 2013,
  • [46] Research on Routing Selection Algorithm Based on Genetic Algorithm
    Gao, Guohong
    Zhang, Baojian
    Li, Xueyong
    Lv, Jinna
    INTELLIGENT COMPUTING AND INFORMATION SCIENCE, PT II, 2011, 135 : 353 - 358
  • [47] Research on Genetic Algorithm and Data Information based on Combined Framework for Nonlinear Functions Optimization
    Ji, Zhigang
    Li, Zhenyu
    Ji, Zhiqiang
    PEEA 2011, 2011, 23
  • [48] Shrinkage estimation analysis of correlated binary data with a diverging number of parameters
    XU PeiRong
    FU WenJiang
    ZHU LiXing
    Science China(Mathematics), 2013, 56 (02) : 359 - 377
  • [49] Research on Anycast Routing Algorithm Based on Genetic Algorithm
    Chun, Zhu
    Min, Jin
    CISST'09: PROCEEDINGS OF THE 3RD WSEAS INTERNATIONAL CONFERENCE ON CIRCUITS, SYSTEMS, SIGNAL AND TELECOMMUNICATIONS, 2009, : 135 - 139
  • [50] Shrinkage estimation analysis of correlated binary data with a diverging number of parameters
    Xu PeiRong
    Fu WenJiang
    Zhu LiXing
    SCIENCE CHINA-MATHEMATICS, 2013, 56 (02) : 359 - 377