Heterogeneous network embedding enabling accurate disease association predictions

被引:18
作者
Xiong, Yun [1 ,2 ]
Guo, Mengjie [1 ,2 ]
Ruan, Lu [1 ,2 ]
Kong, Xiangnan [3 ]
Tang, Chunlei [4 ]
Zhu, Yangyong [1 ,2 ]
Wang, Wei [5 ]
机构
[1] Fudan Univ, Shanghai Key Lab Data Sci, Sch Comp Sci, Shanghai, Peoples R China
[2] Fudan Univ, Shanghai Inst Adv Commun & Data Sci, Shanghai, Peoples R China
[3] Worcester Polytech Inst, Dept Comp Sci, Worcester, MA 01609 USA
[4] Harvard Med Sch, Brigham & Womens Hosp, Boston, MA 02115 USA
[5] Univ Calif Los Angeles, Dept Comp Sci, Scalable Analyt Inst ScAi, Los Angeles, CA 90024 USA
基金
美国国家科学基金会; 美国国家卫生研究院; 中国国家自然科学基金;
关键词
Network embedding; Heterogeneous network; Disease association prediction; INFORMATION; SIMILARITY; VALIDATION; HETESIM; GENES;
D O I
10.1186/s12920-019-0623-3
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Background It is significant to identificate complex biological mechanisms of various diseases in biomedical research. Recently, the growing generation of tremendous amount of data in genomics, epigenomics, metagenomics, proteomics, metabolomics, nutriomics, etc., has resulted in the rise of systematic biological means of exploring complex diseases. However, the disparity between the production of the multiple data and our capability of analyzing data has been broaden gradually. Furthermore, we observe that networks can represent many of the above-mentioned data, and founded on the vector representations learned by network embedding methods, entities which are in close proximity but at present do not actually possess direct links are very likely to be related, therefore they are promising candidate subjects for biological investigation. Results We incorporate six public biological databases to construct a heterogeneous biological network containing three categories of entities (i.e., genes, diseases, miRNAs) and multiple types of edges (i.e., the known relationships). To tackle the inherent heterogeneity, we develop a heterogeneous network embedding model for mapping the network into a low dimensional vector space in which the relationships between entities are preserved well. And in order to assess the effectiveness of our method, we conduct gene-disease as well as miRNA-disease associations predictions, results of which show the superiority of our novel method over several state-of-the-arts. Furthermore, many associations predicted by our method are verified in the latest real-world dataset. Conclusions We propose a novel heterogeneous network embedding method which can adequately take advantage of the abundant contextual information and structures of heterogeneous network. Moreover, we illustrate the performance of the proposed method on directing studies in biology, which can assist in identifying new hypotheses in biological investigation.
引用
收藏
页数:17
相关论文
共 50 条
[21]   Heterogeneous Information Network Embedding for Recommendation [J].
Shi, Chuan ;
Hu, Binbin ;
Zhao, Wayne Xin ;
Yu, Philip S. .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (02) :357-370
[22]   A hierarchical fused fuzzy deep neural network with heterogeneous network embedding for recommendation [J].
Pham, Phu ;
Nguyen, Loan T. T. ;
Nguyen, Ngoc Thanh ;
Kozma, Robert ;
Vo, Bay .
INFORMATION SCIENCES, 2023, 620 :105-124
[23]   Deep heterogeneous network embedding based on Siamese Neural Networks [J].
Zhang, Chen ;
Tang, Zhouhua ;
Yu, Bin ;
Xie, Yu ;
Pan, Ke .
NEUROCOMPUTING, 2020, 388 :1-11
[24]   SAAED: Embedding and Deep Learning Enhance Accurate Prediction of Association Between circRNA and Disease [J].
Liu, Qingyu ;
Yu, Junjie ;
Cai, Yanning ;
Zhang, Guishan ;
Dai, Xianhua .
FRONTIERS IN GENETICS, 2022, 13
[25]   Explanations for Network Embedding-Based Link Predictions [J].
Kang, Bo ;
Lijffijt, Jefrey ;
De Bie, Tijl .
MACHINE LEARNING AND PRINCIPLES AND PRACTICE OF KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021, PT I, 2021, 1524 :473-488
[26]   Network Sampling Using k-hop Random Walks for Heterogeneous Network Embedding [J].
Anil, Akash ;
Singhal, Shubham ;
Jain, Piyush ;
Singh, Sanasam Ranbir ;
Ladhar, Ajay ;
Singh, Sandeep ;
Chugh, Uppinder .
PROCEEDINGS OF THE 6TH ACM IKDD CODS AND 24TH COMAD, 2019, :354-357
[27]   Signed Heterogeneous Network Embedding in Social Media [J].
Rizi, Fatemeh Salehi ;
Granitzer, Michael .
PROCEEDINGS OF THE 35TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING (SAC'20), 2020, :1877-1880
[28]   Embedding Heterogeneous Information Network in Hyperbolic Spaces [J].
Zhang, Yiding ;
Wang, Xiao ;
Liu, Nian ;
Shi, Chuan .
ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2022, 16 (02)
[29]   Heterogeneous Information Network Embedding for Mention Recommendation [J].
Yi, Feng ;
Jiang, Bo ;
Wu, Jianjun .
IEEE ACCESS, 2020, 8 :91394-91404
[30]   Fast Attributed Multiplex Heterogeneous Network Embedding [J].
Liu, Zhijun ;
Huang, Chao ;
Yu, Yanwei ;
Fan, Baode ;
Dong, Junyu .
CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, :995-1004