Detecting coevolving positions in a molecule: why and how to account for phylogeny

被引:34
作者
Dutheil, Julien Y. [1 ]
机构
[1] Univ Montpellier 2, Inst Sci Evolut Montpellier ISEM, CNRS, Unite Mixte Rech UMII,UMR 5554, F-34095 Montpellier 05, France
关键词
coevolution; structure prediction; phylogeny; mutual information; ROC curves; AMINO-ACID SITES; MUTUAL INFORMATION; MAXIMUM-LIKELIHOOD; RESIDUES; RNA; COEVOLUTION; CONSTRAINTS; IDENTIFICATION; SUBSTITUTIONS; PREDICTION;
D O I
10.1093/bib/bbr048
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Positions in a molecule that share a common constraint do not evolve independently, and therefore leave a signature in the patterns of homologous sequences. Exhibiting such positions with a coevolution pattern from a sequence alignment has great potential for predicting functional and structural properties of molecules through comparative analysis. This task is complicated by the existence of additional correlation sources, leading to false predictions. The nature of the data is a major source of noise correlation: sequences are taken from individuals with different degrees of relatedness, and who therefore are intrinsically correlated. This has led to several method developments in different fields that are potentially confusing for non-expert users interested in these methodologies. It also explains why coevolution detection methods are largely unemployed despite the importance of the biological questions they address. In this article, I focus on the role of shared ancestry for understanding molecular coevolution patterns. I review and classify existing coevolution detection methods according to their ability to handle shared ancestry. Using a ribosomal RNA benchmark data set, for which detailed knowledge of the structure and coevolution patterns is available, I demonstrate and explain why taking the underlying evolutionary history of sequences into account is the only way to extract the full coevolution signal in the data. I also evaluate, using rigorous statistical procedures, the best approaches to do so, and discuss several important biological aspects to consider when performing coevolution analyses.
引用
收藏
页码:228 / 243
页数:16
相关论文
共 40 条
[1]   COORDINATED AMINO-ACID CHANGES IN HOMOLOGOUS PROTEIN FAMILIES [J].
ALTSCHUH, D ;
VERNET, T ;
BERTI, P ;
MORAS, D ;
NAGAI, K .
PROTEIN ENGINEERING, 1988, 2 (03) :193-199
[2]   CORRELATION OF COORDINATED AMINO-ACID SUBSTITUTIONS WITH FUNCTION IN VIRUSES RELATED TO TOBACCO MOSAIC-VIRUS [J].
ALTSCHUH, D ;
LESK, AM ;
BLOOMER, AC ;
KLUG, A .
JOURNAL OF MOLECULAR BIOLOGY, 1987, 193 (04) :693-707
[3]   Correlations among amino acid sites in bHLH protein domains: An information theoretic analysis [J].
Atchley, WR ;
Wollenberg, KR ;
Fitch, WM ;
Terhalle, W ;
Dress, AW .
MOLECULAR BIOLOGY AND EVOLUTION, 2000, 17 (01) :164-178
[4]   Assessing the accuracy of prediction algorithms for classification: an overview [J].
Baldi, P ;
Brunak, S ;
Chauvin, Y ;
Andersen, CAF ;
Nielsen, H .
BIOINFORMATICS, 2000, 16 (05) :412-424
[5]   The Comparative RNA Web (CRW) Site:: an online database of comparative sequence and structure information for ribosomal, intron, and other RNAs -: art. no. 2 [J].
Cannone, JJ ;
Subramanian, S ;
Schnare, MN ;
Collett, JR ;
D'Souza, LM ;
Du, YS ;
Feng, B ;
Lin, N ;
Madabusi, LV ;
Müller, KM ;
Pande, N ;
Shang, ZD ;
Yu, N ;
Gutell, RR .
BMC BIOINFORMATICS, 2002, 3 (1)
[6]   Detecting coevolution without phylogenetic trees? Tree-ignorant metrics of coevolution perform as well as tree-aware metrics [J].
Caporaso, J. Gregory ;
Smit, Sandra ;
Easton, Brett C. ;
Hunter, Lawrence ;
Huttley, Gavin A. ;
Knight, Rob .
BMC EVOLUTIONARY BIOLOGY, 2008, 8 (1)
[7]   Detecting coevolving amino acid sites using Bayesian mutational mapping [J].
Dimmic, MW ;
Hubisz, MJ ;
Bustamante, CD ;
Nielsen, R .
BIOINFORMATICS, 2005, 21 :I126-I135
[8]   Mutual information without the influence of phylogeny or entropy dramatically improves residue contact prediction [J].
Dunn, S. D. ;
Wahl, L. M. ;
Gloor, G. B. .
BIOINFORMATICS, 2008, 24 (03) :333-340
[9]   A model-based approach for detecting coevolving positions in a molecule [J].
Dutheil, J ;
Pupko, T ;
Jean-Marie, A ;
Galtier, N .
MOLECULAR BIOLOGY AND EVOLUTION, 2005, 22 (09) :1919-1928
[10]   Detecting groups of coevolving positions in a molecule: a clustering approach [J].
Dutheil, Julien ;
Galtier, Nicolas .
BMC EVOLUTIONARY BIOLOGY, 2007, 7 (1)