Classification of likely functional class for ligand binding sites identified from fragment screening

被引:4
作者
Utges, Javier S. [1 ]
Macgowan, Stuart A. [1 ]
Ives, Callum M. [1 ,2 ,3 ]
Barton, Geoffrey J. [1 ]
机构
[1] Univ Dundee, Sch Life Sci, Div Computat Biol, Dundee, Scotland
[2] Maynooth Univ, Dept Chem, Maynooth, Ireland
[3] Maynooth Univ, Hamilton Inst, Maynooth, Ireland
基金
英国惠康基金; 英国生物技术与生命科学研究理事会;
关键词
PROTEIN SECONDARY STRUCTURE; SEQUENCE ALIGNMENT; CRYSTAL-STRUCTURE; ALAS2; GENE; HELICASE; LEAD; SIMULATIONS; INFERENCE; ANEMIA; SYSTEM;
D O I
10.1038/s42003-024-05970-8
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Fragment screening is used to identify binding sites and leads in drug discovery, but it is often unclear which binding sites are functionally important. Here, data from 37 experiments, and 1309 protein structures binding to 1601 ligands were analysed. A method to group ligands by binding sites is introduced and sites clustered according to profiles of relative solvent accessibility. This identified 293 unique ligand binding sites, grouped into four clusters (C1-4). C1 includes larger, buried, conserved, and population missense-depleted sites, enriched in known functional sites. C4 comprises smaller, accessible, divergent, missense-enriched sites, depleted in functional sites. A site in C1 is 28 times more likely to be functional than one in C4. Seventeen sites, which to the best of our knowledge are novel, in 13 proteins are identified as likely to be functionally important with examples from human tenascin and 5-aminolevulinate synthase highlighted. A multi-layer perceptron, and K-nearest neighbours model are presented to predict cluster labels for ligand binding sites with an accuracy of 96% and 100%, respectively, so allowing functional classification of sites for proteins not in this set. Our findings will be of interest to those studying protein-ligand interactions and developing new drugs or function modulators. The authors classify 293 ligand binding sites from 37 proteins into four groups by their solvent accessibility profile. The groups differ in evolutionary conservation and functional enrichment, and can be predicted by a machine learning model.
引用
收藏
页数:13
相关论文
共 102 条
[1]   MECHANISM AND STEREOCHEMISTRY OF ENZYMIC REACTIONS INVOLVED IN PORPHYRIN BIOSYNTHESIS [J].
AKHTAR, M ;
ABBOUD, MM ;
BARNARD, G ;
JORDAN, P ;
ZAMAN, Z .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY OF LONDON SERIES B-BIOLOGICAL SCIENCES, 1976, 273 (924) :117-136
[2]   Molecular Simulations with Solvent Competition Quantify Water Displaceability and Provide Accurate Interaction Maps of Protein Binding Sites [J].
Alvarez-Garcia, Daniel ;
Barril, Xavier .
JOURNAL OF MEDICINAL CHEMISTRY, 2014, 57 (20) :8530-8539
[3]   Human aminolevulinate synthase structure reveals a eukaryotic-specific autoinhibitory loop regulating substrate binding and product release [J].
Bailey, Henry J. ;
Bezerra, Gustavo A. ;
Marcero, Jason R. ;
Padhi, Siladitya ;
Foster, William R. ;
Rembeza, Elzbieta ;
Roy, Arijit ;
Bishop, David F. ;
Desnick, Robert J. ;
Bulusu, Gopalakrishnan ;
Dailey, Harry A., Jr. ;
Yue, Wyatt W. .
NATURE COMMUNICATIONS, 2020, 11 (01)
[4]  
Barton G.J., 1993, OC - A cluster analysis program
[5]   UniProt: a worldwide hub of protein knowledge [J].
Bateman, Alex ;
Martin, Maria-Jesus ;
Orchard, Sandra ;
Magrane, Michele ;
Alpi, Emanuele ;
Bely, Benoit ;
Bingley, Mark ;
Britto, Ramona ;
Bursteinas, Borisas ;
Busiello, Gianluca ;
Bye-A-Jee, Hema ;
Da Silva, Alan ;
De Giorgi, Maurizio ;
Dogan, Tunca ;
Castro, Leyla Garcia ;
Garmiri, Penelope ;
Georghiou, George ;
Gonzales, Daniel ;
Gonzales, Leonardo ;
Hatton-Ellis, Emma ;
Ignatchenko, Alexandr ;
Ishtiaq, Rizwan ;
Jokinen, Petteri ;
Joshi, Vishal ;
Jyothi, Dushyanth ;
Lopez, Rodrigo ;
Luo, Jie ;
Lussi, Yvonne ;
MacDougall, Alistair ;
Madeira, Fabio ;
Mahmoudy, Mahdi ;
Menchi, Manuela ;
Nightingale, Andrew ;
Onwubiko, Joseph ;
Palka, Barbara ;
Pichler, Klemens ;
Pundir, Sangya ;
Qi, Guoying ;
Raj, Shriya ;
Renaux, Alexandre ;
Lopez, Milagros Rodriguez ;
Saidi, Rabie ;
Sawford, Tony ;
Shypitsyna, Aleksandra ;
Speretta, Elena ;
Turner, Edward ;
Tyagi, Nidhi ;
Vasudev, Preethi ;
Volynkin, Vladimir ;
Wardell, Tony .
NUCLEIC ACIDS RESEARCH, 2019, 47 (D1) :D506-D515
[6]   PROTAC targeted protein degraders: the past is prologue [J].
Bekes, Miklos ;
Langley, David R. ;
Crews, Craig M. .
NATURE REVIEWS DRUG DISCOVERY, 2022, 21 (03) :181-200
[7]   Tenascin-C in fibrosis in multiple organs: Translational implications [J].
Bhattacharyya, Swati ;
Midwood, Kim S. ;
Varga, John .
SEMINARS IN CELL & DEVELOPMENTAL BIOLOGY, 2022, 128 :130-136
[8]  
Boutet E, 2016, METHODS MOL BIOL, V1374, P23, DOI 10.1007/978-1-4939-3167-5_2
[10]   Protein Data Bank: the single global archive for 3D macromolecular structure data [J].
Burley, Stephen K. ;
Berman, Helen M. ;
Bhikadiya, Charmi ;
Bi, Chunxiao ;
Chen, Li ;
Di Costanzo, Luigi ;
Christie, Cole ;
Duarte, Jose M. ;
Dutta, Shuchismita ;
Feng, Zukang ;
Ghosh, Sutapa ;
Goodsell, David S. ;
Green, Rachel Kramer ;
Guranovic, Vladimir ;
Guzenko, Dmytro ;
Hudson, Brian P. ;
Liang, Yuhe ;
Lowe, Robert ;
Peisach, Ezra ;
Periskova, Irina ;
Randle, Chris ;
Rose, Alexander ;
Sekharan, Monica ;
Shao, Chenghua ;
Tao, Yi-Ping ;
Valasatava, Yana ;
Voigt, Maria ;
Westbrook, John ;
Young, Jasmine ;
Zardecki, Christine ;
Zhuravleva, Marina ;
Kurisu, Genji ;
Nakamura, Haruki ;
Kengaku, Yumiko ;
Cho, Hasumi ;
Sato, Junko ;
Kim, Ju Yaen ;
Ikegawa, Yasuyo ;
Nakagawa, Atsushi ;
Yamashita, Reiko ;
Kudou, Takahiro ;
Bekker, Gert-Jan ;
Suzuki, Hirofumi ;
Iwata, Takeshi ;
Yokochi, Masashi ;
Kobayashi, Naohiro ;
Fujiwara, Toshimichi ;
Velankar, Sameer ;
Kleywegt, Gerard J. ;
Anyango, Stephen .
NUCLEIC ACIDS RESEARCH, 2019, 47 (D1) :D520-D528