Predicting sub-cellular localization of tRNA synthetases from their primary structures

被引:5
作者
Panwar, Bharat [1 ]
Raghava, G. P. S. [1 ]
机构
[1] Inst Microbial Technol CSIR, Bioinformat Ctr, Sect 39A, Chandigarh, India
关键词
Mitochondrial tRNA synthetase; Support vector machine; Prediction; MARSpred; SUPPORT VECTOR MACHINE; AMINO-ACID; PROTEIN; MITOCHONDRIAL; CLASSIFICATION; GENE; IMPORT; TOOL; SVM;
D O I
10.1007/s00726-011-0872-8
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Since endo-symbiotic events occur, all genes of mitochondrial aminoacyl tRNA synthetase (AARS) were lost or transferred from ancestral mitochondrial genome into the nucleus. The canonical pattern is that both cytosolic and mitochondrial AARSs coexist in the nuclear genome. In the present scenario all mitochondrial AARSs are nucleus-encoded, synthesized on cytosolic ribosomes and post-translationally imported from the cytosol into the mitochondria in eukaryotic cell. The site-based discrimination between similar types of enzymes is very challenging because they have almost same physico-chemical properties. It is very important to predict the sub-cellular location of AARSs, to understand the mitochondrial protein synthesis. We have analyzed and optimized the distinguishable patterns between cytosolic and mitochondrial AARSs. Firstly, support vector machines (SVM)-based modules have been developed using amino acid and dipeptide compositions and achieved Mathews correlation coefficient (MCC) of 0.82 and 0.73, respectively. Secondly, we have developed SVM modules using position-specific scoring matrix and achieved the maximum MCC of 0.78. Thirdly, we developed SVM modules using N-terminal, intermediate residues, C-terminal and split amino acid composition (SAAC) and achieved MCC of 0.82, 0.70, 0.39 and 0.86, respectively. Finally, a SVM module was developed using selected attributes of split amino acid composition (SA-SAAC) approach and achieved MCC of 0.92 with an accuracy of 96.00%. All modules were trained and tested on a non-redundant data set and evaluated using fivefold cross-validation technique. On the independent data sets, SA-SAAC based prediction model achieved MCC of 0.95 with an accuracy of 97.77%. The web-server 'MARSpred' based on above study is available at http://www.imtech.res.in/raghava/marspred/.
引用
收藏
页码:1703 / 1713
页数:11
相关论文
共 35 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   The role of aminoacyl-tRNA synthetases in genetic diseases [J].
Antonellis, Anthony ;
Green, Eric D. .
ANNUAL REVIEW OF GENOMICS AND HUMAN GENETICS, 2008, 9 :87-107
[3]   Mitochondrial protein-import machinery: correlating structure with function [J].
Baker, Michael J. ;
Frazier, Ann E. ;
Gulbis, Jacqueline M. ;
Ryan, Michael T. .
TRENDS IN CELL BIOLOGY, 2007, 17 (09) :456-464
[4]   Assessing the accuracy of prediction algorithms for classification: an overview [J].
Baldi, P ;
Brunak, S ;
Chauvin, Y ;
Andersen, CAF ;
Nielsen, H .
BIOINFORMATICS, 2000, 16 (05) :412-424
[5]   SPECIFICITY IN PROTEIN SYNTHESIS [J].
BERG, P .
ANNUAL REVIEW OF BIOCHEMISTRY, 1961, 30 :293-&
[6]   GPCRsclass: a web tool for the classification of amine type of G-protein-coupled receptors [J].
Bhasin, M ;
Raghava, GPS .
NUCLEIC ACIDS RESEARCH, 2005, 33 :W143-W147
[7]   Classification of nuclear receptors based on amino acid composition and dipeptide composition [J].
Bhasin, M ;
Raghava, GPS .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2004, 279 (22) :23262-23266
[8]   Origin and evolution of the mitochondrial aminoacyl-tRNA synthetases [J].
Brindefalk, Bjorn ;
Viklund, Johan ;
Larsson, Daniel ;
Thollesson, Mikael ;
Andersson, Siv G. E. .
MOLECULAR BIOLOGY AND EVOLUTION, 2007, 24 (03) :743-756
[9]   Recent progress in protein subcellular location prediction [J].
Chou, Kuo-Chen ;
Shen, Hong-Bin .
ANALYTICAL BIOCHEMISTRY, 2007, 370 (01) :1-16
[10]   Computational method to predict mitochondrially imported proteins and their targeting sequences [J].
Claros, MG ;
Vincens, P .
EUROPEAN JOURNAL OF BIOCHEMISTRY, 1996, 241 (03) :779-786