Spatial rank-based multifactor dimensionality reduction to detect gene-gene interactions for multivariate phenotypes

被引:3
作者
Park, Mira [1 ]
Jeong, Hoe-Bin [2 ]
Lee, Jong-Hyun [2 ]
Park, Taesung [3 ]
机构
[1] Eulji Univ, Dept Prevent Med, Daejeon 34824, South Korea
[2] Korea Univ, Dept Stat, Seoul 02841, South Korea
[3] Seoul Natl Univ, Dept Stat, Seoul 08826, South Korea
基金
新加坡国家研究基金会;
关键词
Fuzzy clustering; Gene-gene interaction; Multifactor dimensionality reduction; Spatial rank statistic; GENOME-WIDE ASSOCIATION; DATA DEPTH; ENVIRONMENT INTERACTIONS; TESTS; STATISTICS; EPISTASIS;
D O I
10.1186/s12859-021-04395-y
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background Identifying interaction effects between genes is one of the main tasks of genome-wide association studies aiming to shed light on the biological mechanisms underlying complex diseases. Multifactor dimensionality reduction (MDR) is a popular approach for detecting gene-gene interactions that has been extended in various forms to handle binary and continuous phenotypes. However, only few multivariate MDR methods are available for multiple related phenotypes. Current approaches use Hotelling's T-2 statistic to evaluate interaction models, but it is well known that Hotelling's T-2 statistic is highly sensitive to heavily skewed distributions and outliers. Results We propose a robust approach based on nonparametric statistics such as spatial signs and ranks. The new multivariate rank-based MDR (MR-MDR) is mainly suitable for analyzing multiple continuous phenotypes and is less sensitive to skewed distributions and outliers. MR-MDR utilizes fuzzy k-means clustering and classifies multi-locus genotypes into two groups. Then, MR-MDR calculates a spatial rank-sum statistic as an evaluation measure and selects the best interaction model with the largest statistic. Our novel idea lies in adopting nonparametric statistics as an evaluation measure for robust inference. We adopt tenfold cross-validation to avoid overfitting. Intensive simulation studies were conducted to compare the performance of MR-MDR with current methods. Application of MR-MDR to a real dataset from a Korean genome-wide association study demonstrated that it successfully identified genetic interactions associated with four phenotypes related to kidney function. The R code for conducting MR-MDR is available at . Conclusions Intensive simulation studies comparing MR-MDR with several current methods showed that the performance of MR-MDR was outstanding for skewed distributions. Additionally, for symmetric distributions, MR-MDR showed comparable power. Therefore, we conclude that MR-MDR is a useful multivariate non-parametric approach that can be used regardless of the phenotype distribution, the correlations between phenotypes, and sample size.
引用
收藏
页数:21
相关论文
共 50 条
[21]   Multiobjective differential evolution-based multifactor dimensionality reduction for detecting gene-gene interactions [J].
Yang, Cheng-Hong ;
Chuang, Li-Yeh ;
Lin, Yu-Da .
SCIENTIFIC REPORTS, 2017, 7
[22]   An efficiency analysis of high-order combinations of gene-gene interactions using multifactor-dimensionality reduction [J].
Yang, Cheng-Hong ;
Lin, Yu-Da ;
Yang, Cheng-San ;
Chuang, Li-Yeh .
BMC GENOMICS, 2015, 16
[23]   Identifying gene-gene interactions that are highly associated with Body Mass Index using Quantitative Multifactor Dimensionality Reduction (QMDR) [J].
Rishika De ;
Shefali S. Verma ;
Fotios Drenos ;
Emily R. Holzinger ;
Michael V. Holmes ;
Molly A. Hall ;
David R. Crosslin ;
David S. Carrell ;
Hakon Hakonarson ;
Gail Jarvik ;
Eric Larson ;
Jennifer A. Pacheco ;
Laura J. Rasmussen-Torvik ;
Carrie B. Moore ;
Folkert W. Asselbergs ;
Jason H. Moore ;
Marylyn D. Ritchie ;
Brendan J. Keating ;
Diane Gilbert-Diamond .
BioData Mining, 8
[24]   Gene-gene interaction analysis for quantitative trait using cluster-based multifactor dimensionality reduction method [J].
Lee, Youjung ;
Kim, Hyein ;
Park, Taesung ;
Park, Mira .
INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2018, 20 (01) :1-11
[25]   Multiple-Criteria Decision Analysis-Based Multifactor Dimensionality Reduction for Detecting Gene-Gene Interactions [J].
Yang, Cheng-Hong ;
Lin, Yu-Da ;
Chuang, Li-Yeh .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2019, 23 (01) :416-426
[26]   A novel survival multifactor dimensionality reduction method for detecting gene-gene interactions with application to bladder cancer prognosis [J].
Gui, Jiang ;
Moore, Jason H. ;
Kelsey, Karl T. ;
Marsit, Carmen J. ;
Karagas, Margaret R. ;
Andrew, Angeline S. .
HUMAN GENETICS, 2011, 129 (01) :101-110
[27]   Multifactor dimensionality reduction: An analysis strategy for modelling and detecting gene-gene interactions in human genetics and pharmacogenomics studies [J].
Motsinger A.A. ;
Ritchie M.D. .
Human Genomics, 2 (5) :318-328
[28]   New evaluation measures for multifactor dimensionality reduction classifiers in gene-gene interaction analysis [J].
Namkung, Junghyun ;
Kim, Kyunga ;
Yi, Sungon ;
Chung, Wonil ;
Kwon, Min-Seok ;
Park, Taesung .
BIOINFORMATICS, 2009, 25 (03) :338-345
[29]   Cluster-based multifactor dimensionality reduction method to identify gene-gene interactions for quantitative traits in genome-wide studies [J].
Lee, Yoojung ;
Kim, Hyein ;
Park, Taesung ;
Park, Mira .
2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2017, :1772-1776
[30]   A Simple and Computationally Efficient Approach to Multifactor Dimensionality Reduction Analysis of Gene-Gene Interactions for Quantitative Traits [J].
Gui, Jiang ;
Moore, Jason H. ;
Williams, Scott M. ;
Andrews, Peter ;
Hillege, Hans L. ;
van der Harst, Pim ;
Navis, Gerjan ;
Van Gilst, Wiek H. ;
Asselbergs, Folkert W. ;
Gilbert-Diamond, Diane .
PLOS ONE, 2013, 8 (06)