Decoy Selection in Protein Structure Determination via Symmetric Non-negative Matrix Factorization

被引:5
|
作者
Kabir, Kazi Lutful [1 ]
Chennupati, Gopinath [2 ]
Vangara, Raviteja [3 ]
Djidjev, Hristo [2 ]
Alexandrov, Boian S. [4 ]
Shehu, Amarda [1 ]
机构
[1] George Mason Univ, Dept Comp Sci, Fairfax, VA 22030 USA
[2] Los Alamos Natl Lab, Informat Sci CCS Grp 3, Los Alamos, NM USA
[3] Los Alamos Natl Lab, Fluid Dynam & Solid Mech T3, Los Alamos, NM USA
[4] Los Alamos Natl Lab, Phys & Chem Mat T1, Los Alamos, NM USA
基金
美国国家科学基金会;
关键词
decoy selection; eigen-gap heuristic; graph clustering; protein structure determination; symmetric NMF;
D O I
10.1109/BIBM49941.2020.9313299
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The so-called dark proteome, referring to regions of the protein universe that remain inaccessible by either wet-or dry-laboratory methods, continues to spur computational research in protein structure determination. An outstanding challenge relates to the ability to discriminate relevant tertiary structure(s) among many structures, also referred to as decoys, that are computed for a protein of interest. The problem is known as decoy selection. While prime for investigation as an inference problem, the decoy datasets generated in silico are sparse and highly imbalanced towards the negative class (irrelevant structures). These characteristics continue to challenge both supervised and unsupervised learning approaches to this problem. In this paper, we propose a novel decoy selection method based on symmetric non-negative matrix factorization in a graph clustering setting. The method is evaluated on two datasets, a benchmark dataset of ensembles of decoys for a varied list of protein molecules, and a dataset of decoy ensembles for targets drawn from the recent CASP competitions. The evaluation demonstrates that the proposed method outperforms several state-of-the-art decoy selection methods. This performance, as well as the method's computational expediency, suggest that the proposed method advances the state of the art in decoy selection and, in particular, our the ability to tackle inherent challenges related to imbalanced datasets.
引用
收藏
页码:23 / 28
页数:6
相关论文
共 50 条
  • [1] Improved Protein Decoy Selection via Non-Negative Matrix Factorization
    Akhter, Nasrin
    Kabir, Kazi Lutful
    Chennupati, Gopinath
    Vangara, Raviteja
    Alexandrov, Boian
    Djidjev, Hristo N.
    Shehu, Amarda
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (03) : 1670 - 1682
  • [2] Determination of the Number of Clusters by Symmetric Non-Negative Matrix Factorization
    Vangara, Raviteja
    Rasmussen, Kim O.
    Chennupati, Gopinath
    Alexandrov, Boian S.
    BIG DATA III: LEARNING, ANALYTICS, AND APPLICATIONS, 2021, 11730
  • [3] Rank selection for non-negative matrix factorization
    Cai, Yun
    Gu, Hong
    Kenney, Toby
    STATISTICS IN MEDICINE, 2023, 42 (30) : 5676 - 5693
  • [4] Non-negative Matrix Factorization with Symmetric Manifold Regularization
    Yang, Shangming
    Liu, Yongguo
    Li, Qiaoqin
    Yang, Wen
    Zhang, Yi
    Wen, Chuanbiao
    NEURAL PROCESSING LETTERS, 2020, 51 (01) : 723 - 748
  • [5] Semisupervised Adaptive Symmetric Non-Negative Matrix Factorization
    Jia, Yuheng
    Liu, Hui
    Hou, Junhui
    Kwong, Sam
    IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (05) : 2550 - 2562
  • [6] Non-negative Matrix Factorization with Symmetric Manifold Regularization
    Shangming Yang
    Yongguo Liu
    Qiaoqin Li
    Wen Yang
    Yi Zhang
    Chuanbiao Wen
    Neural Processing Letters, 2020, 51 : 723 - 748
  • [7] On Rank Selection in Non-Negative Matrix Factorization Using Concordance
    Fogel, Paul
    Geissler, Christophe
    Morizet, Nicolas
    Luta, George
    MATHEMATICS, 2023, 11 (22)
  • [8] Non-Negative Matrix Factorization for Selection of Near-Native Protein Tertiary Structures
    Akhter, Nasrin
    Vangara, Raviteja
    Chennupati, Gopinath
    Alexandrov, Boian S.
    Djidjev, Hristo
    Shehu, Amarda
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 70 - 73
  • [9] Non-Negative Matrix Factorization Revisited: Uniqueness and Algorithm for Symmetric Decomposition
    Huang, Kejun
    Sidiropoulos, Nicholas D.
    Swami, Ananthram
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2014, 62 (01) : 211 - 224
  • [10] Overlapping Community Detection via Self-constrained Symmetric Non-negative Matrix Factorization
    Liu, Yu
    Wu, Bin
    Zhang, Yunlei
    Wang, Bai
    2016 INTERNATIONAL CONFERENCE ON BEHAVIORAL, ECONOMIC AND SOCIO-CULTURAL COMPUTING (BESC), 2016, : 42 - 47