Heterogeneous network embedding enabling accurate disease association predictions

被引:16
|
作者
Xiong, Yun [1 ,2 ]
Guo, Mengjie [1 ,2 ]
Ruan, Lu [1 ,2 ]
Kong, Xiangnan [3 ]
Tang, Chunlei [4 ]
Zhu, Yangyong [1 ,2 ]
Wang, Wei [5 ]
机构
[1] Fudan Univ, Shanghai Key Lab Data Sci, Sch Comp Sci, Shanghai, Peoples R China
[2] Fudan Univ, Shanghai Inst Adv Commun & Data Sci, Shanghai, Peoples R China
[3] Worcester Polytech Inst, Dept Comp Sci, Worcester, MA 01609 USA
[4] Harvard Med Sch, Brigham & Womens Hosp, Boston, MA 02115 USA
[5] Univ Calif Los Angeles, Dept Comp Sci, Scalable Analyt Inst ScAi, Los Angeles, CA 90024 USA
基金
美国国家科学基金会; 中国国家自然科学基金; 美国国家卫生研究院;
关键词
Network embedding; Heterogeneous network; Disease association prediction; INFORMATION; SIMILARITY; VALIDATION; HETESIM; GENES;
D O I
10.1186/s12920-019-0623-3
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Background It is significant to identificate complex biological mechanisms of various diseases in biomedical research. Recently, the growing generation of tremendous amount of data in genomics, epigenomics, metagenomics, proteomics, metabolomics, nutriomics, etc., has resulted in the rise of systematic biological means of exploring complex diseases. However, the disparity between the production of the multiple data and our capability of analyzing data has been broaden gradually. Furthermore, we observe that networks can represent many of the above-mentioned data, and founded on the vector representations learned by network embedding methods, entities which are in close proximity but at present do not actually possess direct links are very likely to be related, therefore they are promising candidate subjects for biological investigation. Results We incorporate six public biological databases to construct a heterogeneous biological network containing three categories of entities (i.e., genes, diseases, miRNAs) and multiple types of edges (i.e., the known relationships). To tackle the inherent heterogeneity, we develop a heterogeneous network embedding model for mapping the network into a low dimensional vector space in which the relationships between entities are preserved well. And in order to assess the effectiveness of our method, we conduct gene-disease as well as miRNA-disease associations predictions, results of which show the superiority of our novel method over several state-of-the-arts. Furthermore, many associations predicted by our method are verified in the latest real-world dataset. Conclusions We propose a novel heterogeneous network embedding method which can adequately take advantage of the abundant contextual information and structures of heterogeneous network. Moreover, we illustrate the performance of the proposed method on directing studies in biology, which can assist in identifying new hypotheses in biological investigation.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Heterogeneous network embedding enabling accurate disease association predictions
    Yun Xiong
    Mengjie Guo
    Lu Ruan
    Xiangnan Kong
    Chunlei Tang
    Yangyong Zhu
    Wei Wang
    BMC Medical Genomics, 12
  • [2] Predicting Disease-related Associations by Heterogeneous Network Embedding
    Xiong, Yun
    Ruan, Lu
    Guo, Mengjie
    Tang, Chunlei
    Kong, Xiangnan
    Zhu, Yangyong
    Wang, Wei
    PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 548 - 555
  • [3] Complex Disease Genes Identification Using a Heterogeneous Network Embedding Approach
    Ghasemi, Mahdieh
    Rahgozar, Maseud
    Kavousi, Kaveh
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (02) : 875 - 882
  • [4] Factor graph-aggregated heterogeneous network embedding for disease-gene association prediction
    Ming He
    Chen Huang
    Bo Liu
    Yadong Wang
    Junyi Li
    BMC Bioinformatics, 22
  • [5] Factor graph-aggregated heterogeneous network embedding for disease-gene association prediction
    He, Ming
    Huang, Chen
    Liu, Bo
    Wang, Yadong
    Li, Junyi
    BMC BIOINFORMATICS, 2021, 22 (01)
  • [6] HerGePred: Heterogeneous Network Embedding Representation for Disease Gene Prediction
    Yang, Kuo
    Wang, Ruyu
    Liu, Guangming
    Shu, Zixin
    Wang, Ning
    Zhang, Runshun
    Yu, Jian
    Chen, Jianxin
    Li, Xiaodong
    Zhou, Xuezhong
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2019, 23 (04) : 1805 - 1815
  • [7] HetNERec: Heterogeneous network embedding based recommendation
    Zhao, Zhongying
    Zhang, Xuejian
    Zhou, Hui
    Li, Chao
    Gong, Maoguo
    Wang, Yongqing
    KNOWLEDGE-BASED SYSTEMS, 2020, 204
  • [8] Multi-view Heterogeneous Network Embedding
    Du, Ouxia
    Zhang, Yujia
    Li, Xinyue
    Zhu, Junyi
    Zheng, Tanghu
    Li, Ya
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT II, 2022, 13369 : 3 - 15
  • [9] DapBCH: a disease association prediction model Based on Cross-species and Heterogeneous graph embedding
    Shi, Wanqi
    Feng, Hailin
    Li, Jian
    Liu, Tongcun
    Liu, Zhe
    FRONTIERS IN GENETICS, 2023, 14
  • [10] Structure-aware attributed heterogeneous network embedding
    Hao Wei
    Gang Xiong
    Qiang Wei
    Weiquan Cao
    Xin Li
    Knowledge and Information Systems, 2023, 65 : 1769 - 1785