Various approaches have explored the covariation of residues in multiple-sequence alignments of homologous proteins to extract functional and structural information. Among those are principal component analysis (PCA), which identifies the most correlated groups of residues, and direct coupling analysis (DCA), a global inference method based on the maximum entropy principle, which aims at predicting residue-residue contacts. In this paper, inspired by the statistical physics of disordered systems, we introduce the Hopfield-Potts model to naturally interpolate between these two approaches. The Hopfield-Potts model allows us to identify relevant 'patterns' of residues from the knowledge of the eigenmodes and eigenvalues of the residue-residue correlation matrix. We show how the computation of such statistical patterns makes it possible to accurately predict residue-residue contacts with a much smaller number of parameters than DCA. This dimensional reduction allows us to avoid overfitting and to extract contact information from multiple-sequence alignments of reduced size. In addition, we show that low-eigenvalue correlation modes, discarded by PCA, are important to recover structural information: the corresponding patterns are highly localized, that is, they are concentrated in few sites, which we find to be in close contact in the three-dimensional protein fold.
机构:
Tel Aviv Univ, George S Wise Fac Life Sci, Dept Biochem & Mol Biol, IL-69978 Tel Aviv, Israel
Tel Aviv Univ, George S Wise Fac Life Sci, Dept Cell Res & Immunol, IL-69978 Tel Aviv, IsraelTel Aviv Univ, George S Wise Fac Life Sci, Dept Biochem & Mol Biol, IL-69978 Tel Aviv, Israel
Ashkenazy, Haim
Erez, Elana
论文数: 0引用数: 0
h-index: 0
机构:
Tel Aviv Univ, George S Wise Fac Life Sci, Dept Biochem & Mol Biol, IL-69978 Tel Aviv, Israel
Tel Aviv Univ, George S Wise Fac Life Sci, Dept Cell Res & Immunol, IL-69978 Tel Aviv, IsraelTel Aviv Univ, George S Wise Fac Life Sci, Dept Biochem & Mol Biol, IL-69978 Tel Aviv, Israel
Erez, Elana
Martz, Eric
论文数: 0引用数: 0
h-index: 0
机构:
Univ Massachusetts, Dept Microbiol, Amherst, MA 01003 USATel Aviv Univ, George S Wise Fac Life Sci, Dept Biochem & Mol Biol, IL-69978 Tel Aviv, Israel
Martz, Eric
Pupko, Tal
论文数: 0引用数: 0
h-index: 0
机构:
Tel Aviv Univ, George S Wise Fac Life Sci, Dept Cell Res & Immunol, IL-69978 Tel Aviv, IsraelTel Aviv Univ, George S Wise Fac Life Sci, Dept Biochem & Mol Biol, IL-69978 Tel Aviv, Israel
Pupko, Tal
Ben-Tal, Nir
论文数: 0引用数: 0
h-index: 0
机构:
Tel Aviv Univ, George S Wise Fac Life Sci, Dept Biochem & Mol Biol, IL-69978 Tel Aviv, IsraelTel Aviv Univ, George S Wise Fac Life Sci, Dept Biochem & Mol Biol, IL-69978 Tel Aviv, Israel
机构:
Carnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA 15213 USACarnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA 15213 USA
Balakrishnan, Sivaraman
Kamisetty, Hetunandan
论文数: 0引用数: 0
h-index: 0
机构:
Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15213 USACarnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA 15213 USA
Kamisetty, Hetunandan
Carbonell, Jaime G.
论文数: 0引用数: 0
h-index: 0
机构:
Carnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA 15213 USA
Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15213 USA
Carnegie Mellon Univ, Lane Ctr Computat Biol, Pittsburgh, PA 15213 USACarnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA 15213 USA
Carbonell, Jaime G.
Lee, Su-In
论文数: 0引用数: 0
h-index: 0
机构:
Univ Washington, Dept Comp Sci & Engn, Seattle, WA 98195 USA
Univ Washington, Dept Genome Sci, Seattle, WA 98195 USACarnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA 15213 USA
Lee, Su-In
Langmead, Christopher James
论文数: 0引用数: 0
h-index: 0
机构:
Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15213 USA
Carnegie Mellon Univ, Lane Ctr Computat Biol, Pittsburgh, PA 15213 USACarnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA 15213 USA
机构:
Rutgers State Univ, Dept Chem & Chem Biol, RCSB PDB, Piscataway, NJ 08854 USARutgers State Univ, Dept Chem & Chem Biol, RCSB PDB, Piscataway, NJ 08854 USA
Berman, Helen M.
Kleywegt, Gerard J.
论文数: 0引用数: 0
h-index: 0
机构:
European Bioinformat Inst, Prot Data Bank Europe, Cambridge CB10 1SD, EnglandRutgers State Univ, Dept Chem & Chem Biol, RCSB PDB, Piscataway, NJ 08854 USA
Kleywegt, Gerard J.
Nakamura, Haruki
论文数: 0引用数: 0
h-index: 0
机构:
Osaka Univ, Inst Prot Res, Prot Data Bank Japan, Suita, Osaka 5650871, JapanRutgers State Univ, Dept Chem & Chem Biol, RCSB PDB, Piscataway, NJ 08854 USA
Nakamura, Haruki
Markley, John L.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Wisconsin, BioMagResBank, Madison, WI 53706 USARutgers State Univ, Dept Chem & Chem Biol, RCSB PDB, Piscataway, NJ 08854 USA
机构:
Tel Aviv Univ, George S Wise Fac Life Sci, Dept Biochem & Mol Biol, IL-69978 Tel Aviv, Israel
Tel Aviv Univ, George S Wise Fac Life Sci, Dept Cell Res & Immunol, IL-69978 Tel Aviv, IsraelTel Aviv Univ, George S Wise Fac Life Sci, Dept Biochem & Mol Biol, IL-69978 Tel Aviv, Israel
Ashkenazy, Haim
Erez, Elana
论文数: 0引用数: 0
h-index: 0
机构:
Tel Aviv Univ, George S Wise Fac Life Sci, Dept Biochem & Mol Biol, IL-69978 Tel Aviv, Israel
Tel Aviv Univ, George S Wise Fac Life Sci, Dept Cell Res & Immunol, IL-69978 Tel Aviv, IsraelTel Aviv Univ, George S Wise Fac Life Sci, Dept Biochem & Mol Biol, IL-69978 Tel Aviv, Israel
Erez, Elana
Martz, Eric
论文数: 0引用数: 0
h-index: 0
机构:
Univ Massachusetts, Dept Microbiol, Amherst, MA 01003 USATel Aviv Univ, George S Wise Fac Life Sci, Dept Biochem & Mol Biol, IL-69978 Tel Aviv, Israel
Martz, Eric
Pupko, Tal
论文数: 0引用数: 0
h-index: 0
机构:
Tel Aviv Univ, George S Wise Fac Life Sci, Dept Cell Res & Immunol, IL-69978 Tel Aviv, IsraelTel Aviv Univ, George S Wise Fac Life Sci, Dept Biochem & Mol Biol, IL-69978 Tel Aviv, Israel
Pupko, Tal
Ben-Tal, Nir
论文数: 0引用数: 0
h-index: 0
机构:
Tel Aviv Univ, George S Wise Fac Life Sci, Dept Biochem & Mol Biol, IL-69978 Tel Aviv, IsraelTel Aviv Univ, George S Wise Fac Life Sci, Dept Biochem & Mol Biol, IL-69978 Tel Aviv, Israel
机构:
Carnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA 15213 USACarnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA 15213 USA
Balakrishnan, Sivaraman
Kamisetty, Hetunandan
论文数: 0引用数: 0
h-index: 0
机构:
Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15213 USACarnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA 15213 USA
Kamisetty, Hetunandan
Carbonell, Jaime G.
论文数: 0引用数: 0
h-index: 0
机构:
Carnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA 15213 USA
Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15213 USA
Carnegie Mellon Univ, Lane Ctr Computat Biol, Pittsburgh, PA 15213 USACarnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA 15213 USA
Carbonell, Jaime G.
Lee, Su-In
论文数: 0引用数: 0
h-index: 0
机构:
Univ Washington, Dept Comp Sci & Engn, Seattle, WA 98195 USA
Univ Washington, Dept Genome Sci, Seattle, WA 98195 USACarnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA 15213 USA
Lee, Su-In
Langmead, Christopher James
论文数: 0引用数: 0
h-index: 0
机构:
Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15213 USA
Carnegie Mellon Univ, Lane Ctr Computat Biol, Pittsburgh, PA 15213 USACarnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA 15213 USA
机构:
Rutgers State Univ, Dept Chem & Chem Biol, RCSB PDB, Piscataway, NJ 08854 USARutgers State Univ, Dept Chem & Chem Biol, RCSB PDB, Piscataway, NJ 08854 USA
Berman, Helen M.
Kleywegt, Gerard J.
论文数: 0引用数: 0
h-index: 0
机构:
European Bioinformat Inst, Prot Data Bank Europe, Cambridge CB10 1SD, EnglandRutgers State Univ, Dept Chem & Chem Biol, RCSB PDB, Piscataway, NJ 08854 USA
Kleywegt, Gerard J.
Nakamura, Haruki
论文数: 0引用数: 0
h-index: 0
机构:
Osaka Univ, Inst Prot Res, Prot Data Bank Japan, Suita, Osaka 5650871, JapanRutgers State Univ, Dept Chem & Chem Biol, RCSB PDB, Piscataway, NJ 08854 USA
Nakamura, Haruki
Markley, John L.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Wisconsin, BioMagResBank, Madison, WI 53706 USARutgers State Univ, Dept Chem & Chem Biol, RCSB PDB, Piscataway, NJ 08854 USA