A statistical physics perspective on alignment-independent protein sequence comparison

被引:13
作者
Chattopadhyay, Amit K. [1 ]
Nasiev, Diar [1 ]
Flower, Darren R. [2 ]
机构
[1] Aston Univ, Nonlinear & Complex Res Grp, Sch Engn & Appl Sci, Birmingham B4 7ET, W Midlands, England
[2] Aston Univ, Sch Life & Hlth Sci, Birmingham B4 7ET, W Midlands, England
关键词
ACID SUBSTITUTION MATRICES; PREDICTION; PERSISTENCE; DESCRIPTORS; PERFORMANCE; DATABASE; TIME; SETS;
D O I
10.1093/bioinformatics/btv167
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Within bioinformatics, the textual alignment of amino acid sequences has long dominated the determination of similarity between proteins, with all that implies for shared structure, function and evolutionary descent. Despite the relative success of modern-day sequence alignment algorithms, so-called alignment-free approaches offer a complementary means of determining and expressing similarity, with potential benefits in certain key applications, such as regression analysis of protein structure-function studies, where alignment-base similarity has performed poorly. Results: Here, we offer a fresh, statistical physics-based perspective focusing on the question of alignment-free comparison, in the process adapting results from 'first passage probability distribution' to summarize statistics of ensemble averaged amino acid propensity values. In this article, we introduce and elaborate this approach.
引用
收藏
页码:2469 / 2474
页数:6
相关论文
共 42 条
[1]   AMINO-ACID SUBSTITUTION MATRICES FROM AN INFORMATION THEORETIC PERSPECTIVE [J].
ALTSCHUL, SF .
JOURNAL OF MOLECULAR BIOLOGY, 1991, 219 (03) :555-565
[3]   Persistence and first-passage properties in nonequilibrium systems [J].
Bray, Alan J. ;
Majumdar, Satya N. ;
Schehr, Gregory .
ADVANCES IN PHYSICS, 2013, 62 (03) :225-361
[4]   Contact time periods in immunological synapse [J].
Bush, Daniel R. ;
Chattopadhyay, Amit K. .
PHYSICAL REVIEW E, 2014, 90 (04)
[5]   Close contact fluctuations: The seeding of signalling domains in the immunological synapse [J].
Chattopadhyay, Amit K. ;
Burroughs, Nigel J. .
EPL, 2007, 77 (04)
[6]   CLUSTER SEPARATION MEASURE [J].
DAVIES, DL ;
BOULDIN, DW .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1979, 1 (02) :224-227
[7]   Proteomic applications of automated GPCR classification [J].
Davies, Matthew N. ;
Gloriam, David E. ;
Secker, Andrew ;
Freitas, Alexa A. ;
Mendao, Miguel ;
Timmis, Jon ;
Flower, Darren R. .
PROTEOMICS, 2007, 7 (16) :2800-2814
[8]  
Dayhoff M., 1978, Atlas of protein sequence and structure, V5, P345
[9]   Statistical comparison of established T-cell epitope predictors against a large database of human and murine antigens [J].
Deavin, AJ ;
Auton, TR ;
Greaney, PJ .
MOLECULAR IMMUNOLOGY, 1996, 33 (02) :145-155
[10]   EXACT FIRST-PASSAGE EXPONENTS OF 1D DOMAIN GROWTH - RELATION TO A REACTION-DIFFUSION MODEL [J].
DERRIDA, B ;
HAKIM, V ;
PASQUIER, V .
PHYSICAL REVIEW LETTERS, 1995, 75 (04) :751-754