BLUPmrMLM: A Fast mrMLM Algorithm in Genome-wide Association Studies

被引:2
|
作者
Li, Hong-Fu [1 ]
Wang, Jing-Tian [1 ]
Zhao, Qiong [1 ]
Zhang, Yuan-Ming [1 ]
机构
[1] Huazhong Agr Univ, Coll Plant Sci & Technol, Wuhan 430070, Peoples R China
基金
中国国家自然科学基金;
关键词
Genome-wide association study; BLUP; Multilocus model; mrMLM; Large-scale dataset; MIXED-MODEL ANALYSIS; VARIABLE SELECTION; MISSING HERITABILITY; VARIANCE-COMPONENTS; EMPIRICAL BAYES; LIKELIHOOD; INTEGRATION; REGRESSION;
D O I
10.1093/gpbjnl/qzae020
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Multilocus genome-wide association study has become the state-of-the-art tool for dissecting the genetic architecture of complex and multiomic traits. However, most existing multilocus methods require relatively long computational time when analyzing large datasets. To address this issue, in this study, we proposed a fast mrMLM method, namely, best linear unbiased prediction multilocus random-SNP-effect mixed linear model (BLUPmrMLM). First, genome-wide single-marker scanning in mrMLM was replaced by vectorized Wald tests based on the best linear unbiased prediction (BLUP) values of marker effects and their variances in BLUPmrMLM. Then, adaptive best subset selection (ABESS) was used to identify potentially associated markers on each chromosome to reduce computational time when estimating marker effects via empirical Bayes. Finally, shared memory and parallel computing schemes were used to reduce the computational time. In simulation studies, BLUPmrMLM outperformed GEMMA, EMMAX, mrMLM, and FarmCPU as well as the control method (BLUPmrMLM with ABESS removed), in terms of computational time, power, accuracy for estimating quantitative trait nucleotide positions and effects, false positive rate, false discovery rate, false negative rate, and F1 score. In the reanalysis of two large rice datasets, BLUPmrMLM significantly reduced the computational time and identified more previously reported genes, compared with the aforementioned methods. This study provides an excellent multilocus model method for the analysis of large-scale and multiomic datasets. The software mrMLM v5.1 is available at BioCode (https://ngdc.cncb.ac.cn/biocode/tool/BT007388) or GitHub (https://github.com/YuanmingZhang65/mrMLM).
引用
收藏
页数:13
相关论文
共 50 条
  • [1] mrMLM v4.0.2: An R Platform for Multi-locus Genome-wide Association Studies
    YaWen Zhang
    Cox Lwaka Tamba
    YangJun Wen
    Pei Li
    WenLong Ren
    YuanLi Ni
    Jun Gao
    YuanMing Zhang
    Genomics,Proteomics & Bioinformatics, 2020, (04) : 481 - 487
  • [2] mrMLM v4.0.2: An R Platform for Multi-locus Genome-wide Association Studies
    Ya-Wen Zhang
    Cox Lwaka Tamba
    Yang-Jun Wen
    Pei Li
    Wen-Long Ren
    Yuan-Li Ni
    Jun Gao
    Yuan-Ming Zhang
    Genomics,Proteomics & Bioinformatics, 2020, 18 (04) : 481 - 487
  • [3] mrMLM v4.0.2: An R Platform for Multi-locus Genome-wide Association Studies
    Zhang, Ya-Wen
    Tamba, Cox Lwaka
    Wen, Yang-Jun
    Li, Pei
    Ren, Wen-Long
    Ni, Yuan-Li
    Gao, Jun
    Zhang, Yuan-Ming
    GENOMICS PROTEOMICS & BIOINFORMATICS, 2020, 18 (04) : 481 - 487
  • [4] A fast algorithm for Bayesian multi-locus model in genome-wide association studies
    Weiwei Duan
    Yang Zhao
    Yongyue Wei
    Sheng Yang
    Jianling Bai
    Sipeng Shen
    Mulong Du
    Lihong Huang
    Zhibin Hu
    Feng Chen
    Molecular Genetics and Genomics, 2017, 292 : 923 - 934
  • [5] A FAST ALGORITHM FOR DETECTING GENE-GENE INTERACTIONS IN GENOME-WIDE ASSOCIATION STUDIES
    Li, Jiahan
    Zhong, Wei
    Li, Runze
    Wu, Rongling
    ANNALS OF APPLIED STATISTICS, 2014, 8 (04): : 2292 - 2318
  • [6] A fast algorithm for Bayesian multi-locus model in genome-wide association studies
    Duan, Weiwei
    Zhao, Yang
    Wei, Yongyue
    Yang, Sheng
    Bai, Jianling
    Shen, Sipeng
    Du, Mulong
    Huang, Lihong
    Hu, Zhibin
    Chen, Feng
    MOLECULAR GENETICS AND GENOMICS, 2017, 292 (04) : 923 - 934
  • [7] Fast pairwise IBD association testing in genome-wide association studies
    Han, Buhm
    Kang, Eun Yong
    Raychaudhuri, Soumya
    de Bakker, Paul I. W.
    Eskin, Eleazar
    BIOINFORMATICS, 2014, 30 (02) : 206 - 213
  • [8] FaST Linear Mixed Models for Genome-Wide Association Studies
    Lippert, Christoph
    Listgarten, Jennifer
    Liu, Ying
    Kadie, Carl M.
    Davidson, Robert I.
    Heckerman, David
    GENETIC EPIDEMIOLOGY, 2012, 36 (02) : 149 - 149
  • [9] FaST linear mixed models for genome-wide association studies
    Christoph Lippert
    Jennifer Listgarten
    Ying Liu
    Carl M Kadie
    Robert I Davidson
    David Heckerman
    Nature Methods, 2011, 8 : 833 - 835
  • [10] FaST linear mixed models for genome-wide association studies
    Lippert, Christoph
    Listgarten, Jennifer
    Liu, Ying
    Kadie, Carl M.
    Davidson, Robert I.
    Heckerman, David
    NATURE METHODS, 2011, 8 (10) : 833 - U94