A deep learning model for predicting optimal distance range in crosslinking mass spectrometry data

被引:1
作者
Cohen, Shon [1 ]
Schneidman-Duhovny, Dina [1 ]
机构
[1] Hebrew Univ Jerusalem, Rachel & Selim Benin Sch Comp Sci & Engn, Jerusalem, Israel
基金
美国国家卫生研究院;
关键词
crosslinking mass spectrometry; protein structure; protein-protein interactions; deep learning; PROTEIN-STRUCTURE; INTEGRATIVE STRUCTURE; PROTEOMICS; ARCHITECTURE; RESTRAINTS; COMPLEXES; MECHANISM; SOFTWARE; KINASE;
D O I
10.1002/pmic.202200341
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Macromolecular assemblies play an important role in all cellular processes. While there has recently been significant progress in protein structure prediction based on deep learning, large protein complexes cannot be predicted with these approaches. The integrative structure modeling approach characterizes multi-subunit complexes by computational integration of data from fast and accessible experimental techniques. Crosslinking mass spectrometry is one such technique that provides spatial information about the proximity of crosslinked residues. One of the challenges in interpreting crosslinking datasets is designing a scoring function that, given a structure, can quantify how well it fits the data. Most approaches set an upper bound on the distance between C alpha atoms of crosslinked residues and calculate a fraction of satisfied crosslinks. However, the distance spanned by the crosslinker greatly depends on the neighborhood of the crosslinked residues. Here, we design a deep learning model for predicting the optimal distance range for a crosslinked residue pair based on the structures of their neighborhoods. We find that our model can predict the distance range with the area under the receiver-operator curve of 0.86 and 0.7 for intra- and inter-protein crosslinks, respectively. Our deep scoring function can be used in a range of structure modeling applications.
引用
收藏
页数:11
相关论文
共 59 条
  • [1] Accurate prediction of protein structures and interactions using a three-track neural network
    Baek, Minkyung
    DiMaio, Frank
    Anishchenko, Ivan
    Dauparas, Justas
    Ovchinnikov, Sergey
    Lee, Gyu Rie
    Wang, Jue
    Cong, Qian
    Kinch, Lisa N.
    Schaeffer, R. Dustin
    Millan, Claudia
    Park, Hahnbeom
    Adams, Carson
    Glassman, Caleb R.
    DeGiovanni, Andy
    Pereira, Jose H.
    Rodrigues, Andria V.
    van Dijk, Alberdina A.
    Ebrecht, Ana C.
    Opperman, Diederik J.
    Sagmeister, Theo
    Buhlheller, Christoph
    Pavkov-Keller, Tea
    Rathinaswamy, Manoj K.
    Dalwadi, Udit
    Yip, Calvin K.
    Burke, John E.
    Garcia, K. Christopher
    Grishin, Nick V.
    Adams, Paul D.
    Read, Randy J.
    Baker, David
    [J]. SCIENCE, 2021, 373 (6557) : 871 - +
  • [2] Serum Albumin Domain Structures in Human Blood Serum by Mass Spectrometry and Computational Biology
    Belsom, Adam
    Schneider, Michael
    Fischer, Lutz
    Brock, Oliver
    Rappsilber, Juri
    [J]. MOLECULAR & CELLULAR PROTEOMICS, 2016, 15 (03) : 1105 - 1116
  • [3] The Protein Data Bank
    Berman, HM
    Westbrook, J
    Feng, Z
    Gilliland, G
    Bhat, TN
    Weissig, H
    Shindyalov, IN
    Bourne, PE
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 235 - 242
  • [4] Brody S., 2021, ARXIV
  • [5] Improved prediction of protein-protein interactions using AlphaFold2
    Bryant, P.
    Pozzati, G.
    Elofsson, A.
    [J]. NATURE COMMUNICATIONS, 2022, 13 (01)
  • [6] The Importance of Non-accessible Crosslinks and Solvent Accessible Surface Distance in Modeling Proteins with Restraints From Crosslinking Mass Spectrometry
    Bullock, Joshua Matthew Allen
    Schwab, Jannik
    Thalassinos, Konstantinos
    Topf, Maya
    [J]. MOLECULAR & CELLULAR PROTEOMICS, 2016, 15 (07) : 2491 - 2500
  • [7] Systems structural biology measurements by in vivo cross-linking with mass spectrometry
    Chavez, Juan D.
    Mohr, Jared P.
    Mathay, Martin
    Zhong, Xuefei
    Keller, Andrew
    Bruce, James E.
    [J]. NATURE PROTOCOLS, 2019, 14 (08) : 2318 - 2343
  • [8] VALIDATION OF THE GENERAL-PURPOSE TRIPOS 5.2 FORCE-FIELD
    CLARK, M
    CRAMER, RD
    VANOPDENBOSCH, N
    [J]. JOURNAL OF COMPUTATIONAL CHEMISTRY, 1989, 10 (08) : 982 - 1012
  • [9] Increasing the Depth of Mass-Spectrometry-Based Structural Analysis of Protein Complexes through the Use of Multiple Cross-Linkers
    Ding, Yue-He
    Fan, Sheng-Bo
    Li, Shuang
    Feng, Bo-Ya
    Gao, Ning
    Ye, Keqiong
    He, Si-Min
    Dong, Meng-Qiu
    [J]. ANALYTICAL CHEMISTRY, 2016, 88 (08) : 4461 - 4469
  • [10] Evans R., 2022, BIORXIV, DOI [10.2021/10.04.463034, DOI 10.2021/10.04.463034]