Semantic Meta-Path Enhanced Global and Local Topology Learning for lncRNA-Disease Association Prediction

被引:8
作者
Xuan, Ping [1 ,2 ]
Zhao, Yue [1 ]
Cui, Hui [3 ]
Zhan, Linyun [4 ]
Jin, Qiangguo [5 ]
Zhang, Tiangang [6 ]
Nakaguchi, Toshiya [7 ]
机构
[1] Heilongjiang Univ, Sch Comp Sci & Technol, Harbin 150080, Peoples R China
[2] Shantou Univ, Sch Engn, Dept Comp Sci, Shantou 515063, Peoples R China
[3] La Trobe Univ, Dept Comp Sci & Informat Technol, Melbourne, Vic 3083, Australia
[4] Heilongjiang Univ, Sch Comp Sci & Technol, Harbin 150080, Peoples R China
[5] Northwestern Polytech Univ, Sch Software, Xian 710060, Shaanxi, Peoples R China
[6] Heilongjiang Univ, Sch Math Sci, Harbin 150080, Peoples R China
[7] Chiba Univ, Ctr Frontier Med Engn, Chiba 2638522, Japan
基金
中国博士后科学基金;
关键词
Diseases; Topology; Semantics; Predictive models; Decoding; Convolution; Bioinformatics; LncRNA-disease association prediction; LncRNA-disease-miRNA heterogeneous graph; graph convolutional autoencoder with attention; meta-path based semantic learning; LONG NONCODING RNAS; FUNCTIONAL SIMILARITY; IDENTIFICATION; GENOME;
D O I
10.1109/TCBB.2022.3209571
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Since abnormal expression of long non-coding RNAs (lncRNAs) is associated with various human diseases, identifying disease-related lncRNAs helps reveal the pathogenesis of diseases. Existing methods for lncRNA-disease association prediction mainly focus on multi-sourced data related to lncRNAs and diseases. The rich semantic information of meta-paths, composed of multiple kinds of connections between lncRNA and disease nodes, is neglected. We propose a new prediction method, MGLDA, to encode and integrate the semantics of multiple meta-paths, the global topology of heterogeneous graph, and pairwise attributes of lncRNA and disease nodes. First, a tri-layer heterogeneous graph is constructed to associate multi-sourced data across the lncRNA, disease, and miRNA nodes. Afterwards, we establish multiple meta-paths connecting the lncRNA and disease nodes to derive and denote various semantics. Each meta-path contains its specific semantics formulated by an embedding strategy, and each embedding covers local topology formed by the diverse semantic connections among the lncRNA, disease, and miRNA nodes. We construct multiple graph convolutional autoencoders (GCA) with topology-level attention to learn global and multiple local topologies from the tri-layer graph and each meta-path, respectively. The topology-level attention mechanism can learn the importance of various global and local topologies for adaptive pairwise topology fusion. Finally, a convolutional autoencoder learns the attribute representations of lncRNA-disease pairs, which integrates the learnt detailed and representative pairwise features. Experimental results show that MGLDA outperforms other state-of-the-art prediction methods in comparison and retrieves more real lncRNA-disease associations in the top-ranked candidates. The ablation study also demonstrates the important contributions of the local and global topology learning, and pairwise attribute learning. Case studies on three diseases further demonstrate MGLDA's ability to identify potential disease-related lncRNAs.
引用
收藏
页码:1480 / 1491
页数:12
相关论文
共 50 条
[1]  
[Anonymous], 2015, P INT C LEARNING REP, DOI [DOI 10.48550/ARXIV.1412.6980, 10.48550/arXiv.1412.6980]
[2]   LncRNADisease 2.0: an updated database of long non-coding RNA-associated diseases [J].
Bao, Zhenyu ;
Yang, Zhen ;
Huang, Zhou ;
Zhou, Yiran ;
Cui, Qinghua ;
Dong, Dong .
NUCLEIC ACIDS RESEARCH, 2019, 47 (D1) :D1034-D1037
[3]   Long Noncoding RNAs: Cellular Address Codes in Development and Disease [J].
Batista, Pedro J. ;
Chang, Howard Y. .
CELL, 2013, 152 (06) :1298-1307
[4]   A Multi-Label Classification Framework to Predict Disease Associations of Long Non-coding RNAs (lncRNAs) [J].
Biswas, Ashis Kumer ;
Zhang, Baoju ;
Wu, Xiaoyong ;
Gao, Jean X. .
PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, 2015, 322 :821-830
[5]   Long non-coding RNAs and complex diseases: from experimental results to computational models [J].
Chen, Xing ;
Yan, Chenggang Clarence ;
Zhang, Xu ;
You, Zhu-Hong .
BRIEFINGS IN BIOINFORMATICS, 2017, 18 (04) :558-576
[6]   IRWRLDA: improved random walk with restart for lncRNA-disease association prediction [J].
Chen, Xing ;
You, Zhu-Hong ;
Yan, Gui-Ying ;
Gong, Dun-Wei .
ONCOTARGET, 2016, 7 (36) :57919-57931
[7]   KATZLDA: KATZ measure for the lncRNA-disease association prediction [J].
Chen, Xing .
SCIENTIFIC REPORTS, 2015, 5
[8]   Constructing lncRNA functional similarity network based on lncRNA-disease associations and disease semantic similarity [J].
Chen, Xing ;
Yan, Chenggang Clarence ;
Luo, Cai ;
Ji, Wen ;
Zhang, Yongdong ;
Dai, Qionghai .
SCIENTIFIC REPORTS, 2015, 5
[9]   Novel human lncRNA-disease association inference based on lncRNA expression profiles [J].
Chen, Xing ;
Yan, Gui-Ying .
BIOINFORMATICS, 2013, 29 (20) :2617-2624
[10]   Genome-wide analysis of long noncoding RNA stability [J].
Clark, Michael B. ;
Johnston, Rebecca L. ;
Inostroza-Ponta, Mario ;
Fox, Archa H. ;
Fortini, Ellen ;
Moscato, Pablo ;
Dinger, Marcel E. ;
Mattick, John S. .
GENOME RESEARCH, 2012, 22 (05) :885-898