Dragon PolyA Spotter: predictor of poly(A) motifs within human genomic DNA sequences

被引:36
作者
Kalkatawi, Manal [1 ]
Rangkuti, Farania [1 ]
Schramm, Michael [1 ]
Jankovic, Boris R. [1 ]
Kamau, Allan [1 ]
Chowdhary, Rajesh [2 ]
Archer, John A. C. [1 ]
Bajic, Vladimir B. [1 ]
机构
[1] King Abdullah Univ Sci & Technol, Computat Biosci Res Ctr, Thuwal 239556900, Saudi Arabia
[2] Marshfield Clin Fdn Med Res & Educ, Biomed Informat Res Ctr, MCRF, Marshfield, WI 54449 USA
关键词
POLYADENYLATION SIGNALS; SITES; MODEL;
D O I
10.1093/bioinformatics/btr602
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Recognition of poly(A) signals in mRNA is relatively straightforward due to the presence of easily recognizable polyadenylic acid tail. However, the task of identifying poly(A) motifs in the primary genomic DNA sequence that correspond to poly(A) signals in mRNA is a far more challenging problem. Recognition of poly(A) signals is important for better gene annotation and understanding of the gene regulation mechanisms. In this work, we present one such poly(A) motif prediction method based on properties of human genomic DNA sequence surrounding a poly(A) motif. These properties include thermodynamic, physico-chemical and statistical characteristics. For predictions, we developed Artificial Neural Network and Random Forest models. These models are trained to recognize 12 most common poly(A) motifs in human DNA. Our predictors are available as a free web-based tool accessible at http://cbrc. kaust. edu. sa/dps. Compared with other reported predictors, our models achieve higher sensitivity and specificity and furthermore provide a consistent level of accuracy for 12 poly(A) motif variants.
引用
收藏
页码:127 / 129
页数:3
相关论文
共 17 条
[1]  
Ahmed Firoz, 2009, In Silico Biology, V9, P135, DOI 10.3233/ISB-2009-0395
[2]   POLYAR, a new computer program for prediction of poly(A) sites in human sequences [J].
Akhtar, Malik Nadeem ;
Bukhari, Syed Abbas ;
Fazal, Zeeshan ;
Qamar, Raheel ;
Shahmuradov, Ilham A. .
BMC GENOMICS, 2010, 11
[3]   Patterns of variant polyadenylation signal usage in human genes [J].
Beaudoing, E ;
Freier, S ;
Wyatt, JR ;
Claverie, JM ;
Gautheret, D .
GENOME RESEARCH, 2000, 10 (07) :1001-1010
[4]   POLY(A), POLY(A) BINDING-PROTEIN AND THE REGULATION OF MESSENGER-RNA STABILITY [J].
BERNSTEIN, P ;
ROSS, J .
TRENDS IN BIOCHEMICAL SCIENCES, 1989, 14 (09) :373-377
[5]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[6]   Prediction of mRNA polyadenylation sites by support vector machine [J].
Cheng, Yiming ;
Miura, Robert M. ;
Tian, Bin .
BIOINFORMATICS, 2006, 22 (19) :2320-2325
[7]   DiProDB: a database for dinucleotide properties [J].
Friedel, Maik ;
Nikolajewa, Swetlana ;
Suehnel, Juergen ;
Wilhelm, Thomas .
NUCLEIC ACIDS RESEARCH, 2009, 37 :D37-D40
[8]  
Hall M., 2009, SIGKDD Explorations, V11, P10, DOI DOI 10.1145/1656274.1656278
[9]   Bioinformatic identification of candidate cis-regulatory elements involved in human mRNA polyadenylation [J].
Hu, J ;
Lutz, CS ;
Wilusz, J ;
Tian, B .
RNA, 2005, 11 (10) :1485-1493
[10]   A classification-based prediction model of messenger RNA polyadenylation sites [J].
Ji, Guoli ;
Wu, Xiaohui ;
Shen, Yingjia ;
Huang, Jiangyin ;
Li, Qingshun Quinn .
JOURNAL OF THEORETICAL BIOLOGY, 2010, 265 (03) :287-296