iTIS-PseTNC: A sequence-based predictor for identifying translation initiation site in human genes using pseudo trinucleotide composition

被引:251
作者
Chen, Wei [1 ,2 ]
Feng, Peng-Mian [3 ]
Deng, En-Ze [4 ]
Lin, Hao [2 ,4 ]
Chou, Kuo-Chen [1 ,2 ,5 ]
机构
[1] Hebei United Univ, Dept Phys, Ctr Genom & Computat Biol, Sch Sci, Tangshan 063000, Peoples R China
[2] Gordon Life Sci Inst, Boston, MA 02478 USA
[3] Hebei United Univ, Sch Publ Hlth, Tangshan 063000, Peoples R China
[4] Univ Elect Sci & Technol China, Ctr Bioinformat, Key Lab Neuroinformat, Minist Educ,Sch Life Sci & Technol, Chengdu 610054, Peoples R China
[5] King Abdulaziz Univ, Ctr Excellence Genom Med Res, Jeddah 21589, Saudi Arabia
关键词
Translation initiation site; Pseudo trinucleotide composition; Physicochemical properties; Support vector machine; Web server; iTIS-PseTNC; AMINO-ACID-COMPOSITION; MEMBRANE-PROTEIN TYPES; SUBCELLULAR LOCATION PREDICTION; SUPPORT VECTOR MACHINES; PHYSICOCHEMICAL PROPERTIES; STRUCTURAL CLASS; SIGNAL PEPTIDES; CLEAVAGE SITES; PSEAAC; ATTRIBUTES;
D O I
10.1016/j.ab.2014.06.022
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Translation is a key process for gene expression. Timely identification of the translation initiation site (TIS) is very important for conducting in-depth genome analysis. With the avalanche of genome sequences generated in the postgenomic age, it is highly desirable to develop automated methods for rapidly and effectively identifying TIS. Although some computational methods were proposed in this regard, none of them considered the global or long-range sequence-order effects of DNA, and hence their prediction quality was limited. To count this kind of effects, a new predictor, called "iTIS-PseTNC," was developed by incorporating the physicochemical properties into the pseudo trinucleotide composition, quite similar to the PseAAC (pseudo amino acid composition) approach widely used in computational proteomics. It was observed by the rigorous cross-validation test on the benchmark dataset that the overall success rate achieved by the new predictor in identifying TIS locations was over 97%. As a web server, iTIS-PseTNC is freely accessible at http://lin.uestc.edu.cn/server/iTIS-PseTNC. To maximize the convenience of the vast majority of experimental scientists, a step-by-step guide is provided on how to use the web server to obtain the desired results without the need to go through detailed mathematical equations, which are presented in this paper just for the integrity of the new prection method. (C) 2014 Elsevier Inc. All rights reserved.
引用
收藏
页码:76 / 83
页数:8
相关论文
共 72 条
[1]   Support vector machines for predicting membrane protein types by using functional domain composition [J].
Cai, YD ;
Zhou, GP ;
Chou, KC .
BIOPHYSICAL JOURNAL, 2003, 84 (05) :3257-3263
[2]   propy: a tool to generate various modes of Chou's PseAAC [J].
Cao, Dong-Sheng ;
Xu, Qing-Song ;
Liang, Yi-Zeng .
BIOINFORMATICS, 2013, 29 (07) :960-962
[3]   iSS-PseDNC: Identifying Splicing Sites Using Pseudo Dinucleotide Composition [J].
Chen, Wei ;
Feng, Peng-Mian ;
Lin, Hao ;
Chou, Kuo-Chen .
BIOMED RESEARCH INTERNATIONAL, 2014, 2014
[4]   iRSpot-PseDNC: identify recombination spots with pseudo dinucleotide composition [J].
Chen, Wei ;
Feng, Peng-Mian ;
Lin, Hao ;
Chou, Kuo-Chen .
NUCLEIC ACIDS RESEARCH, 2013, 41 (06) :e68
[5]   Predicting membrane protein types by incorporating protein topology, domains, signal peptides, and physicochemical properties into the general form of Chou's pseudo amino acid composition [J].
Chen, Yen-Kuang ;
Li, Kuo-Bin .
JOURNAL OF THEORETICAL BIOLOGY, 2013, 318 :1-12
[6]  
Chou K., 2010, Nat. Sci, V2, P1090, DOI DOI 10.4236/NS.2010.210136
[7]  
Chou K.C., 2009, Nat Sci, V1, P63
[8]   Prediction of G-protein-coupled receptor classes [J].
Chou, KC .
JOURNAL OF PROTEOME RESEARCH, 2005, 4 (04) :1413-1418
[9]   A CORRELATION-COEFFICIENT METHOD TO PREDICTING PROTEIN-STRUCTURAL CLASSES FROM AMINO-ACID COMPOSITIONS [J].
CHOU, KC ;
ZHANG, CT .
EUROPEAN JOURNAL OF BIOCHEMISTRY, 1992, 207 (02) :429-433
[10]   Using amphiphilic pseudo amino acid composition to predict enzyme subfamily classes [J].
Chou, KC .
BIOINFORMATICS, 2005, 21 (01) :10-19