Co-evolution based machine-learning for predicting functional interactions between human genes

被引:15
|
作者
Stupp, Doron [1 ]
Sharon, Elad [1 ]
Bloch, Idit [1 ]
Zitnik, Marinka [2 ]
Zuk, Or [3 ]
Tabach, Yuval [1 ]
机构
[1] Hebrew Univ Jerusalem, Inst Med Res Israel Canada, Dept Dev Biol & Canc Res, IL-9112001 Jerusalem, Israel
[2] Harvard Univ, Dept Biomed Informat, Boston, MA 02115 USA
[3] Hebrew Univ Jerusalem, Dept Stat & Data Sci, IL-9190501 Jerusalem, Israel
基金
以色列科学基金会;
关键词
GENOME ANALYSIS; IDENTIFICATION; PROTEIN; VISUALIZATION; DISCOVERY; EXPANSION; COMPONENT; DATABASE; COMPLEX;
D O I
10.1038/s41467-021-26792-w
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Over the next decade, more than a million eukaryotic species are expected to be fully sequenced. This has the potential to improve our understanding of genotype and phenotype crosstalk, gene function and interactions, and answer evolutionary questions. Here, we develop a machine-learning approach for utilizing phylogenetic profiles across 1154 eukaryotic species. This method integrates co-evolution across eukaryotic clades to predict functional interactions between human genes and the context for these interactions. We benchmark our approach showing a 14% performance increase (auROC) compared to previous methods. Using this approach, we predict functional annotations for less studied genes. We focus on DNA repair and verify that 9 of the top 50 predicted genes have been identified elsewhere, with others previously prioritized by high-throughput screens. Overall, our approach enables better annotation of function and functional interactions and facilitates the understanding of evolutionary processes underlying co-evolution. The manuscript is accompanied by a webserver available at: https://mlpp.cs.huji.ac.il.
引用
收藏
页数:14
相关论文
共 33 条
  • [21] Screening of genes co-associated with osteoporosis and chronic HBV infection based on bioinformatics analysis and machine learning
    Yang, Jia
    Yang, Weiguang
    Hu, Yue
    Tong, Linjian
    Liu, Rui
    Liu, Lice
    Jiang, Bei
    Sun, Zhiming
    FRONTIERS IN IMMUNOLOGY, 2024, 15
  • [22] Semi-supervised multi-task learning for predicting interactions between HIV-1 and human proteins
    Qi, Yanjun
    Tastan, Oznur
    Carbonell, Jaime G.
    Klein-Seetharaman, Judith
    Weston, Jason
    BIOINFORMATICS, 2010, 26 (18) : i645 - i652
  • [23] The Potential of Co-Evolution and Interactions of Gut Bacteria-Phages in Bamboo-Eating Pandas: Insights from Dietary Preference-Based Metagenomic Analysis
    Zhang, Mingyue
    Zhou, Yanan
    Cui, Xinyuan
    Zhu, Lifeng
    MICROORGANISMS, 2024, 12 (04)
  • [24] Machine learning-based prognostic model of lactylation-related genes for predicting prognosis and immune infiltration in patients with lung adenocarcinoma
    Gao, Mingjun
    Wang, Mengmeng
    Zhou, Siding
    Hou, Jiaqi
    He, Wenbo
    Shu, Yusheng
    Wang, Xiaolin
    CANCER CELL INTERNATIONAL, 2024, 24 (01)
  • [25] A Machine Learning Method for Differentiating and Predicting Human-Infective Coronavirus Based on Physicochemical Features and Composition of the Spike Protein
    Chao, Wang
    Quan, Zou
    CHINESE JOURNAL OF ELECTRONICS, 2021, 30 (05) : 815 - 823
  • [26] Insight into the Relationships Between Chemical, Protein and Functional Variables in the PBP/GOBP Family in Moths Based on Machine Learning
    Lopez-Cortes, Xaviera A.
    Lara, Gabriel
    Fernandez, Nicolas
    Manriquez-Troncoso, Jose M.
    Venthur, Herbert
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2025, 26 (05)
  • [27] A machine learning approach for genome-wide prediction of morbid and druggable human genes based on systems-level data
    Costa, Pedro R.
    Acencio, Marcio L.
    Lemke, Ney
    BMC GENOMICS, 2010, 11
  • [28] GlycoMine: a machine learning-based approach for predicting N-, C- and O-linked glycosylation in the human proteome
    Li, Fuyi
    Li, Chen
    Wang, Mingjun
    Webb, Geoffrey I.
    Zhang, Yang
    Whisstock, James C.
    Song, Jiangning
    BIOINFORMATICS, 2015, 31 (09) : 1411 - 1419
  • [29] Recognition of 3′-end L1, Alu, processed pseudogenes, and mRNA stem-loops in the human genome using sequence-based and structure-based machine-learning models
    Shein, Alexander
    Zaikin, Anton
    Poptsova, Maria
    SCIENTIFIC REPORTS, 2019, 9 (1)
  • [30] Nonlinear relationship between urban form and transport CO2 emissions: Evidence from Chinese cities based on machine learning
    Li, Linna
    Deng, Zilin
    Huang, Xiaoyan
    JOURNAL OF GEOGRAPHICAL SCIENCES, 2024, 34 (08) : 1558 - 1588