MFPINC: prediction of plant ncRNAs based on multi-source feature fusion

被引:1
作者
Nie, Zhenjun [1 ]
Gao, Mengqing [1 ]
Jin, Xiu [1 ,2 ]
Rao, Yuan [1 ,2 ]
Zhang, Xiaodan [1 ,2 ]
机构
[1] Anhui Agr Univ, Sch Informat & Artificial Intelligence, Hefei 230036, Peoples R China
[2] Minist Agr & Rural Affairs, Key Lab Agr Sensors, Hefei 230036, Peoples R China
关键词
Plants; ncRNA prediction; Fusion of deep feature and sequence feature; FEATURE-SELECTION; NONCODING RNAS; PROTEIN; MECHANISMS;
D O I
10.1186/s12864-024-10439-3
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Non-coding RNAs (ncRNAs) are recognized as pivotal players in the regulation of essential physiological processes such as nutrient homeostasis, development, and stress responses in plants. Common methods for predicting ncRNAs are susceptible to significant effects of experimental conditions and computational methods, resulting in the need for significant investment of time and resources. Therefore, we constructed an ncRNA predictor(MFPINC), to predict potential ncRNA in plants which is based on the PINC tool proposed by our previous studies. Specifically, sequence features were carefully refined using variance thresholding and F-test methods, while deep features were extracted and feature fusion were performed by applying the GRU model. The comprehensive evaluation of multiple standard datasets shows that MFPINC not only achieves more comprehensive and accurate identification of gene sequences, but also significantly improves the expressive and generalization performance of the model, and MFPINC significantly outperforms the existing competing methods in ncRNA identification. In addition, it is worth mentioning that our tool can also be found on Github (https://github.com/Zhenj-Nie/MFPINC) the data and source code can also be downloaded for free.
引用
收藏
页数:23
相关论文
共 53 条
[1]   Annotating non-coding regions of the genome [J].
Alexander, Roger P. ;
Fang, Gang ;
Rozowsky, Joel ;
Snyder, Michael ;
Gerstein, Mark B. .
NATURE REVIEWS GENETICS, 2010, 11 (08) :559-571
[2]   Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning [J].
Alipanahi, Babak ;
Delong, Andrew ;
Weirauch, Matthew T. ;
Frey, Brendan J. .
NATURE BIOTECHNOLOGY, 2015, 33 (08) :831-+
[3]   NON-CODING RNAs IN DEVELOPMENT AND DISEASE: BACKGROUND, MECHANISMS, AND THERAPEUTIC APPROACHES [J].
Beermann, Julia ;
Piccoli, Maria-Teresa ;
Viereck, Janika ;
Thum, Thomas .
PHYSIOLOGICAL REVIEWS, 2016, 96 (04) :1297-1325
[4]   LNCcation: lncRNA localization and function [J].
Bridges, Mary Catherine ;
Daulagala, Amanda C. ;
Kourtidis, Antonis .
JOURNAL OF CELL BIOLOGY, 2021, 220 (02)
[5]   The UCSC Genome Browser database: 2018 update [J].
Casper, Jonathan ;
Zweig, Ann S. ;
Villarreal, Chris ;
Tyner, Cath ;
Speir, Matthew L. ;
Rosenbloom, Kate R. ;
Raney, Brian J. ;
Lee, Christopher M. ;
Lee, Brian T. ;
Karolchik, Donna ;
Hinrichs, Angie S. ;
Haeussler, Maximilian ;
Guruvadoo, Luvina ;
Gonzalez, Jairo Navarro ;
Gibson, David ;
Fiddes, Ian T. ;
Eisenhart, Christopher ;
Diekhans, Mark ;
Clawson, Hiram ;
Barber, Galt P. ;
Armstrong, Joel ;
Haussler, David ;
Kuhn, Robert M. ;
Kent, W. James .
NUCLEIC ACIDS RESEARCH, 2018, 46 (D1) :D762-D769
[6]   Computational model for ncRNA research [J].
Chen, Xing ;
Huang, Li .
BRIEFINGS IN BIOINFORMATICS, 2022, 23 (06)
[7]   Novel human lncRNA-disease association inference based on lncRNA expression profiles [J].
Chen, Xing ;
Yan, Gui-Ying .
BIOINFORMATICS, 2013, 29 (20) :2617-2624
[8]   Deep learning models for disease-associated circRNA prediction: a review [J].
Chen, Yaojia ;
Wang, Jiacheng ;
Wang, Chuyu ;
Liu, Mingxin ;
Zou, Quan .
BRIEFINGS IN BIOINFORMATICS, 2022, 23 (06)
[9]   DincRNA: a comprehensive web-based bioinformatics toolkit for exploring disease associations and ncRNA function [J].
Cheng, Liang ;
Hu, Yang ;
Sun, Jie ;
Zhou, Meng ;
Jiang, Qinghua .
BIOINFORMATICS, 2018, 34 (11) :1953-1956
[10]  
Collobert R, 2011, J MACH LEARN RES, V12, P2493