Classifying metal-binding sites with neural networks

被引:2
|
作者
Oostrom, Marjolein [1 ]
Akers, Sarah [1 ]
Garrett, Noah [2 ]
Hanson, Emma [2 ]
Shaw, Wendy [2 ,3 ]
Laureanti, Joseph A. [2 ,3 ]
机构
[1] Pacific Northwest Natl Lab, Natl Secur Directorate, Richland, WA USA
[2] Pacific Northwest Natl Lab, Phys & Computat Sci Directorate, Richland, WA USA
[3] Pacific Northwest Natl Lab, Phys & Computat Sci, Richland, WA 99352 USA
关键词
amino acids; convolutional neural network; image classification; iron-sulfur; metal-binding sites; metalloenzyme; Rieske; CATALYSIS; PROTEINS; ENZYMES;
D O I
10.1002/pro.4591
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
To advance our ability to predict impacts of the protein scaffold on catalysis, robust classification schemes to define features of proteins that will influence reactivity are needed. One of these features is a protein's metal-binding ability, as metals are critical to catalytic conversion by metalloenzymes. As a step toward realizing this goal, we used convolutional neural networks (CNNs) to enable the classification of a metal cofactor binding pocket within a protein scaffold. CNNs enable images to be classified based on multiple levels of detail in the image, from edges and corners to entire objects, and can provide rapid classification. First, six CNN models were fine-tuned to classify the 20 standard amino acids to choose a performant model for amino acid classification. This model was then trained in two parallel efforts: to classify a 2D image of the environment within a given radius of the central metal binding site, either an Fe ion or a [2Fe-2S] cofactor, with the metal visible (effort 1) or the metal hidden (effort 2). We further used two sub-classifications of the [2Fe-2S] cofactor: (1) a standard [2Fe-2S] cofactor and (2) a Rieske [2Fe-2S] cofactor. The accuracy for the model correctly identifying all three defined features was >95%, despite our perception of the increased challenge of the metalloenzyme identification. This demonstrates that machine learning methodology to classify and distinguish similar metal-binding sites, even in the absence of a visible cofactor, is indeed possible and offers an additional tool for metal-binding site identification in proteins.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] METAL-BINDING SITES IN PROTEINS
    TAINER, JA
    ROBERTS, VA
    GETZOFF, ED
    CURRENT OPINION IN BIOTECHNOLOGY, 1991, 2 (04) : 582 - 591
  • [2] Alternative metal-binding sites in rubrerythrin
    Sieker, LC
    Holmes, M
    Le Trong, I
    Turley, S
    Santarsiero, BD
    Liu, MY
    LeGall, J
    Stenkamp, RE
    NATURE STRUCTURAL BIOLOGY, 1999, 6 (04): : 308 - 309
  • [3] Alternative metal-binding sites in rubrerythrin
    L.C. Sieker
    M. Holmes
    I. Le Trong
    S. Turley
    B.D. Santarsiero
    M.-Y. Liu
    J. LeGall
    R.E. Stenkamp
    Nature Structural Biology, 1999, 6 (4) : 308 - 309
  • [4] THE DESIGN OF METAL-BINDING SITES IN PROTEINS
    REGAN, L
    ANNUAL REVIEW OF BIOPHYSICS AND BIOMOLECULAR STRUCTURE, 1993, 22 : 257 - 281
  • [5] Engineering metal-binding sites in proteins
    Lu, Y
    Valentine, JS
    CURRENT OPINION IN STRUCTURAL BIOLOGY, 1997, 7 (04) : 495 - 500
  • [6] The metal-binding sites of glycose phosphates
    Gilg, Kathrin
    Mayer, Tobias
    Ghaschghaie, Natascha
    Kluefers, Peter
    DALTON TRANSACTIONS, 2009, (38) : 7934 - 7945
  • [7] FULVIC-ACIDS - STRUCTURE AND METAL-BINDING .2. PREDOMINANT METAL-BINDING SITES
    MURRAY, K
    LINDER, PW
    JOURNAL OF SOIL SCIENCE, 1984, 35 (02): : 217 - 222
  • [8] STRUCTURE AND FUNCTION OF METAL-BINDING SITES IN PROTEINS
    BRANDEN, CI
    HOPPE-SEYLERS ZEITSCHRIFT FUR PHYSIOLOGISCHE CHEMIE, 1979, 360 (09): : 1131 - 1131
  • [9] CHEMISTRY OF METAL-BINDING SITES IN MUTANT THIOREDOXINS
    CARADONNA, JP
    HELLINGA, HW
    RICHARDS, FM
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1992, 203 : 717 - INOR
  • [10] THE PREDICTION AND CHARACTERIZATION OF METAL-BINDING SITES IN PROTEINS
    GREGORY, DS
    MARTIN, ACR
    CHEETHAM, JC
    REES, AR
    PROTEIN ENGINEERING, 1993, 6 (01): : 29 - 35