SGANRDA: semi-supervised generative adversarial networks for predicting circRNA-disease associations

被引:38
作者
Wang, Lei [1 ]
Yan, Xin [2 ]
You, Zhu-Hong [1 ]
Zhou, Xi [1 ]
Li, Hao-Yuan [2 ]
Huang, Yu-An [3 ]
机构
[1] Chinese Acad Sci, Xinjiang Tech Inst Phys & Chem, Urumqi 830011, Peoples R China
[2] China Univ Min & Technol, Sch Comp Sci & Technol, Xuzhou 221116, Jiangsu, Peoples R China
[3] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Peoples R China
基金
中国国家自然科学基金; 中国科学院西部之光基金;
关键词
circRNA; circRNA-disease association; deep learning; semi-supervised learning; generative adversarial networks; extreme learning machine; CIRCULAR RNA; SIMILARITY; DATABASE; ABUNDANT;
D O I
10.1093/bib/bbab028
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Emerging research shows that circular RNA (circRNA) plays a crucial role in the diagnosis, occurrence and prognosis of complex human diseases. Compared with traditional biological experiments, the computational method of fusing multi-source biological data to identify the association between circRNA and disease can effectively reduce cost and save time. Considering the limitations of existing computational models, we propose a semi-supervised generative adversarial network (GAN) model SGANRDA for predicting circRNA-disease association. This model first fused the natural language features of the circRNA sequence and the features of disease semantics, circRNA and disease Gaussian interaction profile kernel, and then used all circRNA-disease pairs to pre-train the GAN network, and fine-tune the network parameters through labeled samples. Finally, the extreme learning machine classifier is employed to obtain the prediction result. Compared with the previous supervision model, SGANRDA innovatively introduced circRNA sequences and utilized all the information of circRNA-disease pairs during the pre-training process. This step can increase the information content of the feature to some extent and reduce the impact of too few known associations on the model performance. SGANRDA obtained AUC scores of 0.9411 and 0.9223 in leave-one-out cross-validation and 5-fold cross-validation, respectively. Prediction results on the benchmark dataset show that SGANRDA outperforms other existing models. In addition, 25 of the top 30 circRNA-disease pairs with the highest scores of SGANRDA in case studies were verified by recent literature. These experimental results demonstrate that SGANRDA is a useful model to predict the circRNA-disease association and can provide reliable candidates for biological experiments.
引用
收藏
页数:11
相关论文
共 46 条
[1]   WNT signalling pathways as therapeutic targets in cancer [J].
Anastas, Jamie N. ;
Moon, Randall T. .
NATURE REVIEWS CANCER, 2013, 13 (01) :11-26
[2]   SOME YEAST MITOCHONDRIAL RNAS ARE CIRCULAR [J].
ARNBERG, AC ;
VANOMMEN, GJB ;
GRIVELL, LA ;
VANBRUGGEN, EFJ ;
BORST, P .
CELL, 1980, 19 (02) :313-319
[3]   SPLICING WITH INVERTED ORDER OF EXONS OCCURS PROXIMAL TO LARGE INTRONS [J].
COCQUERELLE, C ;
DAUBERSIES, P ;
MAJERUS, MA ;
KERCKAERT, JP ;
BAILLEUL, B .
EMBO JOURNAL, 1992, 11 (03) :1095-1098
[4]   MISSPLICING YIELDS CIRCULAR RNA MOLECULES [J].
COCQUERELLE, C ;
MASCREZ, B ;
HETUIN, D ;
BAILLEUL, B .
FASEB JOURNAL, 1993, 7 (01) :155-160
[5]   The RNA Binding Protein Quaking Regulates Formation of circRNAs [J].
Conn, Simon J. ;
Pillman, Katherine A. ;
Toubia, John ;
Conn, Vanessa M. ;
Salmanidis, Marika ;
Phillips, Caroline A. ;
Roslan, Suraya ;
Schreiber, Andreas W. ;
Gregory, Philip A. ;
Goodall, Gregory J. .
CELL, 2015, 160 (06) :1125-1134
[6]   Prediction of CircRNA-Disease Associations Using KATZ Model Based on Heterogeneous Networks [J].
Fan, Chunyan ;
Lei, Xiujuan ;
Wu, Fang-Xiang .
INTERNATIONAL JOURNAL OF BIOLOGICAL SCIENCES, 2018, 14 (14) :1950-1959
[7]   CircR2Disease: a manually curated database for experimentally supported circular RNAs associated with various diseases [J].
Fan, Chunyan ;
Lei, Xiujuan ;
Fang, Zengqiang ;
Jiang, Qinghua ;
Wu, Fang-Xiang .
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2018,
[8]  
Ghosal Suman, 2013, Frontiers in Genetics, V4, P283, DOI 10.3389/fgene.2013.00283
[9]   circBase: a database for circular RNAs [J].
Glazar, Petar ;
Papavasileiou, Panagiotis ;
Rajewsky, Nikolaus .
RNA, 2014, 20 (11) :1666-1670
[10]  
Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672