A heterogeneous information network learning model with neighborhood-level structural representation for predicting lncRNA-miRNA interactions

被引:20
作者
Zhao, Bo-Wei [1 ]
Su, Xiao-Rui [2 ]
Yang, Yue [2 ]
Li, Dong-Xu [2 ]
Li, Guo-Dong [2 ]
Hu, Peng-Wei [2 ]
Luo, Xin [1 ]
Hu, Lun [2 ]
机构
[1] Southwest Univ, Coll Comp & Informat Sci, Sch Software, Chongqing 400715, Peoples R China
[2] Chinese Acad Sci, Xinjiang Tech Inst Phys & Chem, Urumqi 830011, Peoples R China
基金
中国国家自然科学基金;
关键词
Network structural representation; Heterogeneous information networks; Biological and network representations; LMIs; NONCODING RNAS; DATABASE; CANCER; GENES;
D O I
10.1016/j.csbj.2024.06.032
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Long non-coding RNAs (lncRNAs) and microRNAs (miRNAs) are closely related to the treatment of human diseases. Traditional biological experiments often require time-consuming and labor-intensive in their search for mechanisms of disease. Computational methods are regarded as an effective way to predict unknown lncRNA-miRNA interactions (LMIs). However, most of them complete their tasks by mainly focusing on a single lncRNA-miRNA network without considering the complex mechanism between biomolecular in life activities, which are believed to be useful for improving the accuracy of LMI prediction. To address this, a heterogeneous information network (HIN) learning model with neighborhood-level structural representation, called HINLMI, to precisely identify LMIs. In particular, HINLMI first constructs a HIN by integrating nine interactions of five biomolecules. After that, different representation learning strategies are applied to learn the biological and network representations of lncRNAs and miRNAs in the HIN from different perspectives. Finally, HINLMI incorporates the XGBoost classifier to predict unknown LMIs using final embeddings of lncRNAs and miRNAs. Experimental results show that HINLMI yields a best performance on the real dataset when compared with state-of-the-art computational models. Moreover, several analysis experiments indicate that the simultaneous consideration of biological knowledge and network topology of lncRNAs and miRNAs allows HINLMI to accurately predict LMIs from a more comprehensive perspective. The promising performance of HINLMI also reveals that the utilization of rich heterogeneous information can provide an alternative insight for HINLMI to identify novel interactions between lncRNAs and miRNAs.
引用
收藏
页码:2924 / 2933
页数:10
相关论文
共 41 条
[1]   WSNMF: Weighted Symmetric Nonnegative Matrix Factorization for attributed graph clustering [J].
Berahmand, Kamal ;
Mohammadi, Mehrnoush ;
Sheikhpour, Razieh ;
Li, Yuefeng ;
Xu, Yue .
NEUROCOMPUTING, 2024, 566
[2]   LncRNADisease: a database for long-non-coding RNA-associated diseases [J].
Chen, Geng ;
Wang, Ziyun ;
Wang, Dongqing ;
Qiu, Chengxiang ;
Liu, Mingxi ;
Chen, Xing ;
Zhang, Qipeng ;
Yan, Guiying ;
Cui, Qinghua .
NUCLEIC ACIDS RESEARCH, 2013, 41 (D1) :D983-D986
[3]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[4]   Ensemble of decision tree reveals potential miRNA-disease associations [J].
Chen, Xing ;
Zhu, Chi-Chi ;
Yin, Jun .
PLOS COMPUTATIONAL BIOLOGY, 2019, 15 (07)
[5]   Novel human lncRNA-disease association inference based on lncRNA expression profiles [J].
Chen, Xing ;
Yan, Gui-Ying .
BIOINFORMATICS, 2013, 29 (20) :2617-2624
[6]  
Chong V, 2009, Colorectal cancer: incidence and trend in Brunei Darussalam
[7]   miRTarBase update 2018: a resource for experimentally validated microRNA-target interactions [J].
Chou, Chih-Hung ;
Shrestha, Sirjana ;
Yang, Chi-Dung ;
Chang, Nai-Wen ;
Lin, Yu-Ling ;
Liao, Kuang-Wen ;
Huang, Wei-Chi ;
Sun, Ting-Hsuan ;
Tu, Siang-Jyun ;
Lee, Wei-Hsiang ;
Chiew, Men-Yee ;
Tai, Chun-San ;
Wei, Ting-Yen ;
Tsai, Tzi-Ren ;
Huang, Hsin-Tzu ;
Wang, Chung-Yu ;
Wu, Hsin-Yi ;
Ho, Shu-Yi ;
Chen, Pin-Rong ;
Chuang, Cheng-Hsun ;
Hsieh, Pei-Jung ;
Wu, Yi-Shin ;
Chen, Wen-Liang ;
Li, Meng-Ju ;
Wu, Yu-Chun ;
Huang, Xin-Yi ;
Ng, Fung Ling ;
Buddhakosai, Waradee ;
Huang, Pei-Chun ;
Lan, Kuan-Chun ;
Huang, Chia-Yen ;
Weng, Shun-Long ;
Cheng, Yeong-Nan ;
Liang, Chao ;
Hsu, Wen-Lian ;
Huang, Hsien-Da .
NUCLEIC ACIDS RESEARCH, 2018, 46 (D1) :D296-D302
[8]   The Comparative Toxicogenomics Database: update 2017 [J].
Davis, Allan Peter ;
Grondin, Cynthia J. ;
Johnson, Robin J. ;
Sciaky, Daniela ;
King, Benjamin L. ;
McMorran, Roy ;
Wiegers, Jolene ;
Wiegers, Thomas C. ;
Mattingly, Carolyn J. .
NUCLEIC ACIDS RESEARCH, 2017, 45 (D1) :D972-D978
[9]   NONCODEV5: a comprehensive annotation database for long non-coding RNAs [J].
Fang, ShuangSang ;
Zhang, LiLi ;
Guo, JinCheng ;
Niu, YiWei ;
Wu, Yang ;
Li, Hui ;
Zhao, Lian He ;
Li, Xi Yuan ;
Teng, Xue Yi ;
Sun, XianHui ;
Sun, Liang ;
Zhang, Michael Q. ;
Chen, RunSheng ;
Zhao, Yi .
NUCLEIC ACIDS RESEARCH, 2018, 46 (D1) :D308-D314
[10]   Health-aware food recommendation system with dual attention in heterogeneous graphs [J].
Forouzandeh, Saman ;
Rostami, Mehrdad ;
Berahmand, Kamal ;
Sheikhpour, Razieh .
COMPUTERS IN BIOLOGY AND MEDICINE, 2024, 169