High-Accuracy ncRNA Function Prediction via Deep Learning Using Global and Local Sequence Information

被引:1
作者
Orro, Alessandro [1 ]
Trombetti, Gabriele. A. A. [1 ]
机构
[1] Natl Res Council ITB CNR, Inst Biomed Technol, I-20054 Segrate, Italy
关键词
artificial intelligence; bioinformatics; genomics; ncRNA; function prediction; machine learning; NONCODING RNAS; DATABASE;
D O I
10.3390/biomedicines11061631
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The prediction of the biological function of non-coding ribonucleic acid (ncRNA) is an important step towards understanding the regulatory mechanisms underlying many diseases. Since non-coding RNAs are present in great abundance in human cells and are functionally diverse, developing functional prediction tools is necessary. With recent advances in non-coding RNA biology and the availability of complete genome sequences for a large number of species, we now have a window of opportunity for studying non-coding RNA biology. However, the computational methods used to predict the non-coding RNA functions are mostly either scarcely accurate, when based on sequence information alone, or prohibitively expensive in terms of computational burden when a secondary structure prediction is needed. We propose a novel computational method to predict the biological function of non-coding RNA genes that is based on a collection of deep network architectures utilizing solely ncRNA sequence information and which does not rely on or require expensive secondary ncRNA structure information. The approach presented in this work exhibits comparable or superior accuracy to methods that employ both sequence and structural features, at a much lower computational cost.
引用
收藏
页数:13
相关论文
共 33 条
[1]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[2]   lncRNAdb: a reference database for long noncoding RNAs [J].
Amaral, Paulo P. ;
Clark, Michael B. ;
Gascoigne, Dennis K. ;
Dinger, Marcel E. ;
Mattick, John S. .
NUCLEIC ACIDS RESEARCH, 2011, 39 :D146-D151
[3]   Group invariant Peano curves [J].
Cannon, James W. ;
Thurston, William P. .
GEOMETRY & TOPOLOGY, 2007, 11 :1315-1355
[4]   ncRDeep: Non-coding RNA classification with convolutional neural network [J].
Chantsalnyam, Tuvshinbayar ;
Lim, Dae Yeong ;
Tayara, Hilal ;
Chong, Kil To .
COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2020, 88
[5]   Identification and classification of ncRNA molecules using graph properties [J].
Childs, Liam ;
Nikoloski, Zoran ;
May, Patrick ;
Walther, Dirk .
NUCLEIC ACIDS RESEARCH, 2009, 37 (09)
[6]   A review of some techniques for inclusion of domain-knowledge into deep neural networks [J].
Dash, Tirtharaj ;
Chitlangia, Sharad ;
Ahuja, Aditya ;
Srinivasan, Ashwin .
SCIENTIFIC REPORTS, 2022, 12 (01)
[7]   Landscape of transcription in human cells [J].
Djebali, Sarah ;
Davis, Carrie A. ;
Merkel, Angelika ;
Dobin, Alex ;
Lassmann, Timo ;
Mortazavi, Ali ;
Tanzer, Andrea ;
Lagarde, Julien ;
Lin, Wei ;
Schlesinger, Felix ;
Xue, Chenghai ;
Marinov, Georgi K. ;
Khatun, Jainab ;
Williams, Brian A. ;
Zaleski, Chris ;
Rozowsky, Joel ;
Roeder, Maik ;
Kokocinski, Felix ;
Abdelhamid, Rehab F. ;
Alioto, Tyler ;
Antoshechkin, Igor ;
Baer, Michael T. ;
Bar, Nadav S. ;
Batut, Philippe ;
Bell, Kimberly ;
Bell, Ian ;
Chakrabortty, Sudipto ;
Chen, Xian ;
Chrast, Jacqueline ;
Curado, Joao ;
Derrien, Thomas ;
Drenkow, Jorg ;
Dumais, Erica ;
Dumais, Jacqueline ;
Duttagupta, Radha ;
Falconnet, Emilie ;
Fastuca, Meagan ;
Fejes-Toth, Kata ;
Ferreira, Pedro ;
Foissac, Sylvain ;
Fullwood, Melissa J. ;
Gao, Hui ;
Gonzalez, David ;
Gordon, Assaf ;
Gunawardena, Harsha ;
Howald, Cedric ;
Jha, Sonali ;
Johnson, Rory ;
Kapranov, Philipp ;
King, Brandon .
NATURE, 2012, 489 (7414) :101-108
[8]   An integrated encyclopedia of DNA elements in the human genome [J].
Dunham, Ian ;
Kundaje, Anshul ;
Aldred, Shelley F. ;
Collins, Patrick J. ;
Davis, CarrieA. ;
Doyle, Francis ;
Epstein, Charles B. ;
Frietze, Seth ;
Harrow, Jennifer ;
Kaul, Rajinder ;
Khatun, Jainab ;
Lajoie, Bryan R. ;
Landt, Stephen G. ;
Lee, Bum-Kyu ;
Pauli, Florencia ;
Rosenbloom, Kate R. ;
Sabo, Peter ;
Safi, Alexias ;
Sanyal, Amartya ;
Shoresh, Noam ;
Simon, Jeremy M. ;
Song, Lingyun ;
Trinklein, Nathan D. ;
Altshuler, Robert C. ;
Birney, Ewan ;
Brown, James B. ;
Cheng, Chao ;
Djebali, Sarah ;
Dong, Xianjun ;
Dunham, Ian ;
Ernst, Jason ;
Furey, Terrence S. ;
Gerstein, Mark ;
Giardine, Belinda ;
Greven, Melissa ;
Hardison, Ross C. ;
Harris, Robert S. ;
Herrero, Javier ;
Hoffman, Michael M. ;
Iyer, Sowmya ;
Kellis, Manolis ;
Khatun, Jainab ;
Kheradpour, Pouya ;
Kundaje, Anshul ;
Lassmann, Timo ;
Li, Qunhua ;
Lin, Xinying ;
Marinov, Georgi K. ;
Merkel, Angelika ;
Mortazavi, Ali .
NATURE, 2012, 489 (7414) :57-74
[9]   Non-coding RNAs in human disease [J].
Esteller, Manel .
NATURE REVIEWS GENETICS, 2011, 12 (12) :861-874
[10]   nRC: non-coding RNA Classifier based on structural features [J].
Fiannaca, Antonino ;
La Rosa, Massimo ;
La Paglia, Laura ;
Rizzo, Riccardo ;
Urso, Alfonso .
BIODATA MINING, 2017, 10