Inferring Potential CircRNA-Disease Associations via Deep Autoencoder-Based Classification

被引:43
作者
Deepthi, K. [1 ,2 ]
Jereesh, A. S. [1 ]
机构
[1] Cochin Univ Sci & Technol, Bioinformat Lab, Dept Comp Sci, Kochi 682022, Kerala, India
[2] Vadakara CAPE, Coll Engn, Dept Comp Sci, Kozhikkode 673104, Kerala, India
关键词
CIRCULAR RNAS; ROC CURVE; PREDICTION; ONTOLOGY;
D O I
10.1007/s40291-020-00499-y
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Aim Circular RNAs (circRNA) are endogenous non-coding RNA molecules with a stable circular conformation. Growing evidence from recent experiments reveals that dysregulations and abnormal expressions of circRNAs are correlated with complex diseases. Therefore, identifying the causal circRNAs behind diseases is invaluable in explaining the disease pathogenesis. Since biological experiments are difficult, slow-progressing, and prohibitively expensive, computational approaches are necessary for identifying the relationships between circRNAs and diseases. Methods We propose an ensemble method called AE-RF, based on a deep autoencoder and random forest classifier, to predict potential circRNA-disease associations. The method first integrates circRNA and disease similarities to construct features. The integrated features are sent to the deep autoencoder, to extract hidden biological patterns. With the extracted deep features, the random forest classifier is trained for association prediction. Results and discussion AE-RF achieved AUC scores of 0.9486 and 0.9522, in fivefold and tenfold cross-validation experiments, respectively. We conducted case studies on the top-most predicted results and three common human cancers. We compared the method with state-of-the-art classifiers and related methods. The experimental results and case studies demonstrate the prediction power of the model, and it outperforms previous methods with high degree of robustness. Training the classifier with the unique features retrieved by the autoencoder enhanced the model's predictive performance. The top predicted circRNAs are promising candidates for further biological tests.
引用
收藏
页码:87 / 97
页数:11
相关论文
共 60 条
  • [21] Predicting human disease-associated circRNAs based on locality-constrained linear coding
    Ge, Erxia
    Yang, Yingjuan
    Gang, Mingjun
    Fan, Chunlong
    Zhao, Qi
    [J]. GENOMICS, 2020, 112 (02) : 1335 - 1342
  • [22] Circular RNAs: Biogenesis, Function and Role in Human Diseases
    Greene, John
    Baird, Anne-Marie
    Brady, Lauren
    Lim, Marvin
    Gray, Steven G.
    McDermott, Raymond
    Finn, Stephen P.
    [J]. FRONTIERS IN MOLECULAR BIOSCIENCES, 2017, 4
  • [23] Receiver Operating Characteristic (ROC) Curve for Medical Researchers
    Kumar, Rajeev
    Indrayan, Abhaya
    [J]. INDIAN PEDIATRICS, 2011, 48 (04) : 277 - 287
  • [24] Circular RNAs: diversity of form and function
    Lasda, Erika
    Parker, Roy
    [J]. RNA, 2014, 20 (12) : 1829 - 1842
  • [25] Integrating random walk with restart and k-Nearest Neighbor to identify novel circRNA-disease association
    Lei, Xiujuan
    Bian, Chen
    [J]. SCIENTIFIC REPORTS, 2020, 10 (01)
  • [26] PWCDA: Path Weighted Method for Predicting circRNA-Disease Associations
    Lei, Xiujuan
    Fang, Zengqiang
    Chen, Luonan
    Wu, Fang-Xiang
    [J]. INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2018, 19 (11)
  • [27] Identification of non-coding RNAs with a new composite feature in the Hybrid Random Forest Ensemble algorithm
    Lertampaiporn, Supatcha
    Thammarongtham, Chinae
    Nukoolkit, Chakarida
    Kaewkamnerdpong, Boonserm
    Ruengjitchatchawalya, Marasri
    [J]. NUCLEIC ACIDS RESEARCH, 2014, 42 (11) : e93
  • [28] NCPCDA: network consistency projection for circRNA-disease association prediction
    Li, Guanghui
    Yue, Yingjie
    Liang, Cheng
    Xiao, Qiu
    Ding, Pingjian
    Luo, Jiawei
    [J]. RSC ADVANCES, 2019, 9 (57) : 33222 - 33228
  • [29] Inferring MicroRNA-Disease Associations by Random Walk on a Heterogeneous Network with Multiple Data Sources
    Liu, Yuansheng
    Zeng, Xiangxiang
    He, Zengyou
    Zou, Quan
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2017, 14 (04) : 905 - 915
  • [30] Prediction of lncRNA-disease associations based on inductive matrix completion
    Lu, Chengqian
    Yang, Mengyun
    Luo, Feng
    Wu, Fang-Xiang
    Li, Min
    Pan, Yi
    Li, Yaohang
    Wang, Jianxin
    [J]. BIOINFORMATICS, 2018, 34 (19) : 3357 - 3364