TooT-T: discrimination of transport proteins from non-transport proteins

被引:7
作者
Alballa, Munira [1 ]
Butler, Gregory [1 ,2 ]
机构
[1] Concordia Univ, Dept Comp Sci & Software Engn, Montreal, PQ, Canada
[2] Concordia Univ, Ctr Struct & Funct Genom, Montreal, PQ, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Transporter prediction; Ensemble learning; Amino acid composition; PREDICTION;
D O I
10.1186/s12859-019-3311-6
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background Membrane transport proteins (transporters) play an essential role in every living cell by transporting hydrophilic molecules across the hydrophobic membranes. While the sequences of many membrane proteins are known, their structure and function is still not well characterized and understood, owing to the immense effort needed to characterize them. Therefore, there is a need for advanced computational techniques takes sequence information alone to distinguish membrane transporter proteins; this can then be used to direct new experiments and give a hint about the function of a protein. Results This work proposes an ensemble classifier TooT-T that is trained to optimally combine the predictions from homology annotation transfer and machine-learning methods to determine the final prediction. Experimental results obtained by cross-validation and independent testing show that combining the two approaches is more beneficial than employing only one. Conclusion The proposed model outperforms all of the state-of-the-art methods that rely on the protein sequence alone, with respect to accuracy and MCC. TooT-T achieved an overall accuracy of 90.07% and 92.22% and an MCC 0.80 and 0.82 with the training and independent datasets, respectively.
引用
收藏
页数:10
相关论文
共 24 条
  • [1] Aggarwal CC, 2014, CH CRC DATA MIN KNOW, P457
  • [2] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [3] [Anonymous], 1994, NIPS
  • [4] [Anonymous], TECHNICAL REPORT
  • [5] [Anonymous], 2011, [No title captured]
  • [6] Aplop F., 2017, ARPN J ENG APPL SCI, V12, P317
  • [7] Transferring functional annotations of membrane transporters on the basis of sequence similarity and sequence motifs
    Barghash, Ahmad
    Helms, Volkhard
    [J]. BMC BIOINFORMATICS, 2013, 14
  • [8] Bekkar M., 2013, J Inf Eng Appl, V3, P27
  • [9] Prediction of protein cellular attributes using pseudo-amino acid composition
    Chou, KC
    [J]. PROTEINS-STRUCTURE FUNCTION AND GENETICS, 2001, 43 (03): : 246 - 255
  • [10] PREDICTION OF PROTEIN ANTIGENIC DETERMINANTS FROM AMINO-ACID-SEQUENCES
    HOPP, TP
    WOODS, KR
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA-BIOLOGICAL SCIENCES, 1981, 78 (06): : 3824 - 3828