Using expression quantitative trait loci data and graph-embedded neural networks to uncover genotype-phenotype interactions

被引:1
作者
Guo, Xinpeng [1 ,2 ]
Han, Jinyu [3 ]
Song, Yafei [2 ]
Yin, Zhilei [1 ]
Liu, Shuaichen [4 ]
Shang, Xuequn [1 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci & Engn, Xian, Peoples R China
[2] Air Force Engn Univ, Sch Air & Missile Def, Xian, Peoples R China
[3] Changan Univ, Sch Econ & Management, Xian, Peoples R China
[4] Northwestern Polytech Univ, Sch Marine Sci & Technol, Xian, Peoples R China
基金
中国国家自然科学基金;
关键词
eQTL; expression quantitative trait loci; graph-embedded deep neural network; genotype-phenotype; SNP; gene; INTEGRATION; GWAS;
D O I
10.3389/fgene.2022.921775
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Motivation: A central goal of current biology is to establish a complete functional link between the genotype and phenotype, known as the so-called genotype-phenotype map. With the continuous development of high-throughput technology and the decline in sequencing costs, multi-omics analysis has become more widely employed. While this gives us new opportunities to uncover the correlation mechanisms between single-nucleotide polymorphism (SNP), genes, and phenotypes, multi-omics still faces certain challenges, specifically: 1) When the sample size is large enough, the number of omics types is often not large enough to meet the requirements of multi-omics analysis; 2) each omics' internal correlations are often unclear, such as the correlation between genes in genomics; 3) when analyzing a large number of traits (p), the sample size (n) is often smaller than p, n << p, hindering the application of machine learning methods in the classification of disease outcomes. Results: To solve these issues with multi-omics and build a robust classification model, we propose a graph-embedded deep neural network (G-EDNN) based on expression quantitative trait loci (eQTL) data, which achieves sparse connectivity between network layers to prevent overfitting. The correlation within each omics is also considered such that the model more closely resembles biological reality. To verify the capabilities of this method, we conducted experimental analysis using the GSE28127 and GSE95496 data sets from the Gene Expression Omnibus (GEO) database, tested various neural network architectures, and used prior data for feature selection and graph embedding. Results show that the proposed method could achieve a high classification accuracy and easy-to-interpret feature selection. This method represents an extended application of genotype-phenotype association analysis in deep learning networks.
引用
收藏
页数:10
相关论文
共 45 条
  • [41] The dbGaP data browser: a new tool for browsing dbGaP controlled-access genomic data
    Wong, Kira M.
    Langlais, Kristofor
    Tobias, Geoffrey S.
    Fletcher-Hoppe, Colette
    Krasnewich, Donna
    Leeds, Hilary S.
    Rodriguez, Laura Lyman
    Godynskiy, Georgy
    Schneider, Valerie A.
    Ramos, Erin M.
    Sherry, Stephen T.
    [J]. NUCLEIC ACIDS RESEARCH, 2017, 45 (D1) : D819 - D826
  • [42] Integration of methylation QTL and enhancer-target gene maps with schizophrenia GWAS summary results identifies novel genes
    Wu, Chong
    Pan, Wei
    [J]. BIOINFORMATICS, 2019, 35 (19) : 3576 - 3583
  • [43] An integrative functional genomics framework for effective identification of novel regulatory variants in genome-phenome studies
    Zhao, Junfei
    Cheng, Feixiong
    Jia, Peilin
    Cox, Nancy
    Denny, Joshua C.
    Zhao, Zhongming
    [J]. GENOME MEDICINE, 2018, 10
  • [44] Identifying drug-target interactions based on graph convolutional network and deep neural network
    Zhao, Tianyi
    Hu, Yang
    Valsdottir, Linda R.
    Zang, Tianyi
    Peng, Jiajie
    [J]. BRIEFINGS IN BIOINFORMATICS, 2021, 22 (02) : 2141 - 2150
  • [45] Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets
    Zhu, Zhihong
    Zhang, Futao
    Hu, Han
    Bakshi, Andrew
    Robinson, Matthew R.
    Powell, Joseph E.
    Montgomery, Grant W.
    Goddard, Michael E.
    Wray, Naomi R.
    Visscher, Peter M.
    Yang, Jian
    [J]. NATURE GENETICS, 2016, 48 (05) : 481 - +