Prediction of Back-splicing sites for CircRNA formation based on convolutional neural networks

被引:16
作者
Shen, Zhen [1 ]
Shao, Yan Ling [1 ]
Liu, Wei [1 ]
Zhang, Qinhu [2 ,3 ,4 ]
Yuan, Lin [5 ]
机构
[1] Nanyang Inst Technol, Sch Comp & Software, Changjiang Rd 80, Nanyang 473004, Henan, Peoples R China
[2] Tongji Univ, Translat Med Ctr Stem Cell Therapy, Siping Rd 1239, Shanghai 200092, Peoples R China
[3] Tongji Univ, Inst Regenerat Med, Shanghai East Hosp, Bioinformat Dept,Sch Life Sci & Technol, Siping Rd 1239, Shanghai 200092, Peoples R China
[4] Tongji Univ, Inst Machine Learning & Syst Biol, Sch Elect & Informat Engn, Caoan Rd 4800, Shanghai 201804, Peoples R China
[5] Qilu Univ Technol, Shandong Acad Sci, Sch Comp Sci & Technol, Daxue Rd 3501, Jinan 250353, Shandong, Peoples R China
基金
中国国家自然科学基金;
关键词
CircRNA; Back-splicing sites prediction; Deep learning; Convolutional neural networks; Batch normalization; CIRCULAR RNAS; BIOGENESIS; MECHANISMS; EXPRESSION;
D O I
10.1186/s12864-022-08820-1
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Circular RNAs (CircRNAs) play critical roles in gene expression regulation and disease development. Understanding the regulation mechanism of CircRNAs formation can help reveal the role of CircRNAs in various biological processes mentioned above. Back-splicing is important for CircRNAs formation. Back-splicing sites prediction helps uncover the mysteries of CircRNAs formation. Several methods were proposed for back-splicing sites prediction or circRNA-realted prediction tasks. Model performance was constrained by poor feature learning and using ability. Results: In this study, CircCNN was proposed to predict pre-mRNA back-splicing sites. Convolution neural network and batch normalization are the main parts of CircCNN. Experimental results on three datasets show that CircCNN outperforms other baseline models. Moreover, PPM (Position Probability Matrix) features extract by CircCNN were converted as motifs. Further analysis reveals that some of motifs found by CircCNN match known motifs involved in gene expression regulation, the distribution of motif and special short sequence is important for pre-mRNA back-splicing. Conclusions: In general, the findings in this study provide a new direction for exploring CircRNA-related gene expression regulatory mechanism and identifying potential targets for complex malignant diseases. The datasets and source code of this study are freely available at: https://github.com/szhh521/CircCNN.
引用
收藏
页数:12
相关论文
共 47 条
[1]   circRNA Biogenesis Competes with Pre-mRNA Splicing [J].
Ashwal-Fluss, Reut ;
Meyer, Markus ;
Pamudurti, Nagarjuna Reddy ;
Ivanov, Andranik ;
Bartok, Osnat ;
Hanan, Mor ;
Evantal, Naveh ;
Memczak, Sebastian ;
Rajewsky, Nikolaus ;
Kadener, Sebastian .
MOLECULAR CELL, 2014, 56 (01) :55-66
[2]  
Bailey T L, 1994, Proc Int Conf Intell Syst Mol Biol, V2, P28
[3]   Modulation of PKM alternative splicing by PTBP1 promotes gemcitabine resistance in pancreatic cancer cells [J].
Calabretta, S. ;
Bielli, P. ;
Passacantilli, I. ;
Pilozzi, E. ;
Fendrich, V. ;
Capurso, G. ;
Delle Fave, G. ;
Sette, C. .
ONCOGENE, 2016, 35 (16) :2031-2039
[4]   circDeep: deep learning approach for circular RNA classification from other long non-coding RNA [J].
Chaabane, Mohamed ;
Williams, Robert M. ;
Stephens, Austin T. ;
Park, Juw Won .
BIOINFORMATICS, 2020, 36 (01) :73-80
[5]   The expanding regulatory mechanisms and cellular functions of circular RNAs [J].
Chen, Ling-Ling .
NATURE REVIEWS MOLECULAR CELL BIOLOGY, 2020, 21 (08) :475-490
[6]   The biogenesis and emerging roles of circular RNAs [J].
Chen, Ling-Ling .
NATURE REVIEWS MOLECULAR CELL BIOLOGY, 2016, 17 (04) :205-211
[7]   circRNADb: A comprehensive database for human circular RNAs with protein-coding annotations [J].
Chen, Xiaoping ;
Han, Ping ;
Zhou, Tao ;
Guo, Xuejiang ;
Song, Xiaofeng ;
Li, Yan .
SCIENTIFIC REPORTS, 2016, 6
[8]   Regulation of cyclooxygenase-2 expression by the translational silencer TIA-1 [J].
Dixon, DA ;
Balch, GC ;
Kedersha, N ;
Anderson, P ;
Zimmerman, GA ;
Beauchamp, RD ;
Prescott, SM .
JOURNAL OF EXPERIMENTAL MEDICINE, 2003, 198 (03) :475-481
[9]   The Circular RNA circSKA3 Binds Integrin β1 to Induce Invadopodium Formation Enhancing Breast Cancer Invasion [J].
Du, William W. ;
Yang, Weining ;
Li, Xiangmin ;
Fang, Ling ;
Wu, Nan ;
Li, Feiya ;
Chen, Yu ;
He, Qihan ;
Liu, Elizabeth ;
Yang, Zhenguo ;
Awan, Faryal Mehwish ;
Liu, Mingyao ;
Yang, Burton B. .
MOLECULAR THERAPY, 2020, 28 (05) :1287-1298
[10]   An Evolutionarily Conserved uORF Regulates PGC1α and Oxidative Metabolism in Mice, Flies, and Bluefin Tuna [J].
Dumesic, Phillip A. ;
Egan, Daniel F. ;
Gut, Philipp ;
Tran, Mei T. ;
Parisi, Alice ;
Chatterjee, Nirmalya ;
Jedrychowski, Mark ;
Paschini, Margherita ;
Kazak, Lawrence ;
Wilensky, Sarah E. ;
Dou, Florence ;
Bogoslavski, Dina ;
Cartier, Jeffrey A. ;
Perrimon, Norbert ;
Kajimura, Shingo ;
Parikh, Samir M. ;
Spiegelman, Bruce M. .
CELL METABOLISM, 2019, 30 (01) :190-+