Prioritizing CircRNA-Disease Associations With Convolutional Neural Network Based on Multiple Similarity Feature Fusion

被引:28
作者
Fan, Chunyan [1 ]
Lei, Xiujuan [1 ]
Pan, Yi [2 ]
机构
[1] Shaanxi Normal Univ, Sch Comp Sci, Xian, Peoples R China
[2] Georgia State Univ, Dept Comp Sci, Atlanta, GA 30303 USA
基金
中国国家自然科学基金;
关键词
circRNA-disease associations; circRNA-miRNA interaction; similarity kernel fusion; feature matrix; convolutional neural network; CIRCULAR RNAS; SEMANTIC SIMILARITY; LANDSCAPE; DATABASE; PREDICTION; LNCRNA;
D O I
10.3389/fgene.2020.540751
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Accumulating evidence shows that circular RNAs (circRNAs) have significant roles in human health and in the occurrence and development of diseases. Biological researchers have identified disease-related circRNAs that could be considered as potential biomarkers for clinical diagnosis, prognosis, and treatment. However, identification of circRNA-disease associations using traditional biological experiments is still expensive and time-consuming. In this study, we propose a novel method named MSFCNN for the task of circRNA-disease association prediction, involving two-layer convolutional neural networks on a feature matrix that fuses multiple similarity kernels and interaction features among circRNAs, miRNAs, and diseases. First, four circRNA similarity kernels and seven disease similarity kernels are constructed based on the biological or topological properties of circRNAs and diseases. Subsequently, the similarity kernel fusion method is used to integrate the similarity kernels into one circRNA similarity kernel and one disease similarity kernel, respectively. Then, a feature matrix for each circRNA-disease pair is constructed by integrating the fused circRNA similarity kernel and fused disease similarity kernel with interactions and features among circRNAs, miRNAs, and diseases. The features of circRNA-miRNA and disease-miRNA interactions are selected using principal component analysis. Finally, taking the constructed feature matrix as an input, we used two-layer convolutional neural networks to predict circRNA-disease association labels and mine potential novel associations. Five-fold cross validation shows that our proposed model outperforms conventional machine learning methods, including support vector machine, random forest, and multilayer perception approaches. Furthermore, case studies of predicted circRNAs for specific diseases and the top predicted circRNA-disease associations are analyzed. The results show that the MSFCNN model could be an effective tool for mining potential circRNA-disease associations.
引用
收藏
页数:13
相关论文
共 62 条
[41]  
Salakhutdinov R. R., 2012, arXiv:1207.0580.
[42]   A ceRNA Hypothesis: The Rosetta Stone of a Hidden RNA Language? [J].
Salmena, Leonardo ;
Poliseno, Laura ;
Tay, Yvonne ;
Kats, Lev ;
Pandolfi, Pier Paolo .
CELL, 2011, 146 (03) :353-358
[43]   Human Disease Ontology 2018 update: classification, content and workflow expansion [J].
Schriml, Lynn M. ;
Mitraka, Elvira ;
Munro, James ;
Tauber, Becky ;
Schor, Mike ;
Nickle, Lance ;
Felix, Victor ;
Jeng, Linda ;
Bearer, Cynthia ;
Lichenstein, Richard ;
Bisordi, Katharine ;
Campion, Nicole ;
Hyman, Brooke ;
Kurland, David ;
Oates, Connor Patrick ;
Kibbey, Siobhan ;
Sreekumar, Poorna ;
Le, Chris ;
Giglio, Michelle ;
Greene, Carol .
NUCLEIC ACIDS RESEARCH, 2019, 47 (D1) :D955-D962
[44]   CircCode: A Powerful Tool for Identifying circRNA Coding Ability [J].
Sun, Peisen ;
Li, Guanglin .
FRONTIERS IN GENETICS, 2019, 10
[45]   Genome-wide identification and functional analysis of circRNAs in Zea mays [J].
Tang, Baihua ;
Hao, Zhiqiang ;
Zhu, Yanfeng ;
Zhang, Hua ;
Li, Guanglin .
PLOS ONE, 2018, 13 (12)
[46]   Gaussian interaction profile kernels for predicting drug-target interaction [J].
van Laarhoven, Twan ;
Nabuurs, Sander B. ;
Marchiori, Elena .
BIOINFORMATICS, 2011, 27 (21) :3036-3043
[47]   The Landscape of Circular RNA in Cancer [J].
Vo, Josh N. ;
Cieslik, Marcin ;
Zhang, Yajia ;
Shukla, Sudhanshu ;
Xiao, Lanbo ;
Zhang, Yuping ;
Wu, Yi-Mi ;
Dhanasekaran, Saravana M. ;
Engelke, Carl G. ;
Cao, Xuhong ;
Robinson, Dan R. ;
Nesvizhskii, Alexey I. ;
Chinnaiyan, Arul M. .
CELL, 2019, 176 (04) :869-+
[48]   A new method to measure the semantic similarity of GO terms [J].
Wang, James Z. ;
Du, Zhidian ;
Payattakool, Rapeeporn ;
Yu, Philip S. ;
Chen, Chin-Fu .
BIOINFORMATICS, 2007, 23 (10) :1274-1281
[49]   Deep learning of the back-splicing code for circular RNA formation [J].
Wang, Jun ;
Wang, Liangjiang .
BIOINFORMATICS, 2019, 35 (24) :5235-5242
[50]   Predicting circRNA-Disease Associations Based on circRNA Expression Similarity and Functional Similarity [J].
Wang, Yongtian ;
Nie, Chenxi ;
Zang, Tianyi ;
Wang, Yadong .
FRONTIERS IN GENETICS, 2019, 10