Improving circRNA-disease association prediction by sequence and ontology representations with convolutional and recurrent neural networks

被引:34
作者
Lu, Chengqian [1 ,2 ]
Zeng, Min [1 ,2 ]
Wu, Fang-Xiang [3 ]
Li, Min [1 ,2 ]
Wang, Jianxin [1 ,2 ]
机构
[1] Cent South Univ, Sch Comp Sci & Engn, Changsha 410083, Peoples R China
[2] Cent South Univ, Hunan Prov Key Lab Bioinformat, Changsha 410083, Peoples R China
[3] Univ Saskatchewan, Div Biomed Engn, Saskatoon, SK S7N 5A9, Canada
基金
中国国家自然科学基金;
关键词
CIRCULAR RNAS;
D O I
10.1093/bioinformatics/btaa1077
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Emerging studies indicate that circular RNAs (circRNAs) are widely involved in the progression of human diseases. Due to its special structure which is stable, circRNAs are promising diagnostic and prognostic biomarkers for diseases. However, the experimental verification of circRNA-disease associations is expensive and limited to small-scale. Effective computational methods for predicting potential circRNA-disease associations are regarded as a matter of urgency. Although several models have been proposed, over-reliance on known associations and the absence of characteristics of biological functions make precise predictions are still challenging. Results: In this study, we propose a method for predicting CircRNA-disease associations based on sequence and ontology representations, named CDASOR, with convolutional and recurrent neural networks. For sequences of circRNAs, we encode them with continuous k-mers, get low-dimensional vectors of k-mers, extract their local feature vectors with 1D CNN and learn their long-term dependencies with bi-directional long short-term memory. For diseases, we serialize disease ontology into sentences containing the hierarchy of ontology, obtain low-dimensional vectors for disease ontology terms and get terms' dependencies. Furthermore, we get association patterns of circRNAs and diseases from known circRNA-disease associations with neural networks. After the above steps, we get circRNAs' and diseases' high-level representations, which are informative to improve the prediction. The experimental results show that CDASOR provides an accurate prediction. Importing the characteristics of biological functions, CDASOR achieves impressive predictions in the de novo test. In addition, 6 of the top-10 predicted results are verified by the published literature in the case studies. Availability and implementation: The code and data of CDASOR are freely available at https://github.com/ BioinformaticsCSU/CDASOR. Contact: jxwang@mail.csu.edu.cn
引用
收藏
页码:5656 / 5664
页数:9
相关论文
共 46 条
[1]   LncRNADisease 2.0: an updated database of long non-coding RNA-associated diseases [J].
Bao, Zhenyu ;
Yang, Zhen ;
Huang, Zhou ;
Zhou, Yiran ;
Cui, Qinghua ;
Dong, Dong .
NUCLEIC ACIDS RESEARCH, 2019, 47 (D1) :D1034-D1037
[2]   The Unified Medical Language System (UMLS): integrating biomedical terminology [J].
Bodenreider, O .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D267-D270
[3]   The lncLocator: a subcellular localization predictor for long non-coding RNAs based on a stacked ensemble classifier [J].
Cao, Zhen ;
Pan, Xiaoyong ;
Yang, Yang ;
Huang, Yan ;
Shen, Hong-Bin .
BIOINFORMATICS, 2018, 34 (13) :2185-2194
[4]  
De S., 2018, Convergence guarantees for rmsprop and adam in non-convex optimization and an empirical comparison to nesterov acceleration, VV3
[5]   FUS affects circular RNA expression in murine embryonic stem cell-derived motor neurons [J].
Errichelli, Lorenzo ;
Modigliani, Stefano Dini ;
Laneve, Pietro ;
Colantoni, Alessio ;
Legnini, Ivano ;
Capauto, Davide ;
Rosa, Alessandro ;
De Santis, Riccardo ;
Scarfo, Rebecca ;
Peruzzi, Giovanna ;
Lu, Lei ;
Caffarelli, Elisa ;
Shneider, Neil A. ;
Morlando, Mariangela ;
Bozzoni, Irene .
NATURE COMMUNICATIONS, 2017, 8
[6]   Prediction of CircRNA-Disease Associations Using KATZ Model Based on Heterogeneous Networks [J].
Fan, Chunyan ;
Lei, Xiujuan ;
Wu, Fang-Xiang .
INTERNATIONAL JOURNAL OF BIOLOGICAL SCIENCES, 2018, 14 (14) :1950-1959
[7]   CircR2Disease: a manually curated database for experimentally supported circular RNAs associated with various diseases [J].
Fan, Chunyan ;
Lei, Xiujuan ;
Fang, Zengqiang ;
Jiang, Qinghua ;
Wu, Fang-Xiang .
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2018,
[8]   circBase: a database for circular RNAs [J].
Glazar, Petar ;
Papavasileiou, Panagiotis ;
Rajewsky, Nikolaus .
RNA, 2014, 20 (11) :1666-1670
[9]  
Hamosh A, 2005, NUCLEIC ACIDS RES, V33, pD514
[10]   Natural RNA circles function as efficient microRNA sponges [J].
Hansen, Thomas B. ;
Jensen, Trine I. ;
Clausen, Bettina H. ;
Bramsen, Jesper B. ;
Finsen, Bente ;
Damgaard, Christian K. ;
Kjems, Jorgen .
NATURE, 2013, 495 (7441) :384-388