Structure-guided approach for detecting large domain inserts in protein sequences as illustrated using the haloacid dehalogenase superfamily

被引:3
|
作者
Pandya, Chetanya [1 ]
Dunaway-Mariano, Debra [2 ]
Xia, Yu [3 ]
Allen, Karen N. [1 ,4 ]
机构
[1] Boston Univ, Bioinformat Grad Program, Boston, MA 02215 USA
[2] Univ New Mexico, Dept Chem & Chem Biol, Albuquerque, NM 87131 USA
[3] McGill Univ, Fac Engn, Dept Bioengn, Montreal, PQ H3A 0C3, Canada
[4] Boston Univ, Dept Chem, Boston, MA 02215 USA
基金
美国国家科学基金会;
关键词
sequence analysis; domain boundary prediction; structure function relationship; protein evolution; BIOCHEMICAL FUNCTION; CRYSTAL-STRUCTURES; HAD SUPERFAMILY; PHOSPHATASE; DIVERGENCE; EVOLUTION; GENERATION; PREDICTION; DIVERSITY; PACKAGE;
D O I
10.1002/prot.24543
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
In multi-domain proteins, the domains typically run end-to-end, that is, one domain follows the C-terminus of another domain. However, approximately 10% of multi-domain proteins are formed by insertion of one domain sequence into that of another domain. Detecting such insertions within protein sequences is a fundamental challenge in structural biology. The haloacid dehalogenase superfamily (HADSF) serve as a challenging model system wherein a variable cap domain (similar to 5-200 residues in length) accessorizes the ubiquitous Rossmann-fold core domain, with variations in insertion site and topology corresponding to different classes of cap types. Herein, we describe a comprehensive computational strategy, CapPredictor, for determining large, variable domain insertions in protein sequences. Using a novel sequence-alignment algorithm in conjunction with a structure-guided sequence profile from 154 core-domain-only structures, more than 40,000 HADSF member sequences were assigned cap types. The resulting data set afforded insight into HADSF evolution. Notably, a similar distribution of cap-type classes across different phyla was observed, indicating that all cap types existed in the last universal common ancestor. In addition, comparative analyses of the predicted cap-type and functional assignments showed that different cap types carry out similar chemistries. Thus, while cap domains play a role in substrate recognition and chemical reactivity, cap-type does not strictly define functional class. Through this example, we have shown that CapPredictor is an effective new tool for the study of form and function in protein families where domain insertion occurs.
引用
收藏
页码:1896 / 1906
页数:11
相关论文
共 1 条
  • [1] Structure-Guided Approach to Identify Potential Inhibitors of Large Envelope Protein to Prevent Hepatitis B Virus Infection
    Mehmankhah, Mahboubeh
    Bhat, Ruchika
    Anvar, Mohammad Sabery
    Ali, Shahnawaz
    Alam, Aftab
    Farooqui, Anam
    Amir, Fatima
    Anwer, Ayesha
    Khan, Saniya
    Azmi, Iqbal
    Ali, Rafat
    Ishrat, Romana
    Hassan, Imtaiyaz
    Minuchehr, Zarrin
    Kazim, Syed Naqui
    OXIDATIVE MEDICINE AND CELLULAR LONGEVITY, 2019, 2019