Decoy Selection in Protein Structure Determination via Symmetric Non-negative Matrix Factorization

被引:5
|
作者
Kabir, Kazi Lutful [1 ]
Chennupati, Gopinath [2 ]
Vangara, Raviteja [3 ]
Djidjev, Hristo [2 ]
Alexandrov, Boian S. [4 ]
Shehu, Amarda [1 ]
机构
[1] George Mason Univ, Dept Comp Sci, Fairfax, VA 22030 USA
[2] Los Alamos Natl Lab, Informat Sci CCS Grp 3, Los Alamos, NM USA
[3] Los Alamos Natl Lab, Fluid Dynam & Solid Mech T3, Los Alamos, NM USA
[4] Los Alamos Natl Lab, Phys & Chem Mat T1, Los Alamos, NM USA
基金
美国国家科学基金会;
关键词
decoy selection; eigen-gap heuristic; graph clustering; protein structure determination; symmetric NMF;
D O I
10.1109/BIBM49941.2020.9313299
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The so-called dark proteome, referring to regions of the protein universe that remain inaccessible by either wet-or dry-laboratory methods, continues to spur computational research in protein structure determination. An outstanding challenge relates to the ability to discriminate relevant tertiary structure(s) among many structures, also referred to as decoys, that are computed for a protein of interest. The problem is known as decoy selection. While prime for investigation as an inference problem, the decoy datasets generated in silico are sparse and highly imbalanced towards the negative class (irrelevant structures). These characteristics continue to challenge both supervised and unsupervised learning approaches to this problem. In this paper, we propose a novel decoy selection method based on symmetric non-negative matrix factorization in a graph clustering setting. The method is evaluated on two datasets, a benchmark dataset of ensembles of decoys for a varied list of protein molecules, and a dataset of decoy ensembles for targets drawn from the recent CASP competitions. The evaluation demonstrates that the proposed method outperforms several state-of-the-art decoy selection methods. This performance, as well as the method's computational expediency, suggest that the proposed method advances the state of the art in decoy selection and, in particular, our the ability to tackle inherent challenges related to imbalanced datasets.
引用
收藏
页码:23 / 28
页数:6
相关论文
共 50 条
  • [21] Algorithms for non-negative matrix factorization
    Lee, DD
    Seung, HS
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 13, 2001, 13 : 556 - 562
  • [22] Non-negative Matrix Factorization for EEG
    Jahan, Ibrahim Salem
    Snasel, Vaclav
    2013 INTERNATIONAL CONFERENCE ON TECHNOLOGICAL ADVANCES IN ELECTRICAL, ELECTRONICS AND COMPUTER ENGINEERING (TAEECE), 2013, : 183 - 187
  • [23] Non-Negative Matrix Factorization with Constraints
    Liu, Haifeng
    Wu, Zhaohui
    PROCEEDINGS OF THE TWENTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-10), 2010, : 506 - 511
  • [24] Dropout non-negative matrix factorization
    He, Zhicheng
    Liu, Jie
    Liu, Caihua
    Wang, Yuan
    Yin, Airu
    Huang, Yalou
    KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 60 (02) : 781 - 806
  • [25] Non-negative matrix factorization with α-divergence
    Cichocki, Andrzej
    Lee, Hyekyoung
    Kim, Yong-Deok
    Choi, Seungjin
    PATTERN RECOGNITION LETTERS, 2008, 29 (09) : 1433 - 1440
  • [26] Uniqueness of non-negative matrix factorization
    Laurberg, Hans
    2007 IEEE/SP 14TH WORKSHOP ON STATISTICAL SIGNAL PROCESSING, VOLS 1 AND 2, 2007, : 44 - 48
  • [27] Stretched non-negative matrix factorization
    Gu, Ran
    Rakita, Yevgeny
    Lan, Ling
    Thatcher, Zach
    Kamm, Gabrielle E.
    O'Nolan, Daniel
    Mcbride, Brennan
    Wustrow, Allison
    Neilson, James R.
    Chapman, Karena W.
    Du, Qiang
    Billinge, Simon J. L.
    NPJ COMPUTATIONAL MATERIALS, 2024, 10 (01)
  • [28] Non-negative Matrix Factorization on Manifold
    Cai, Deng
    He, Xiaofei
    Wu, Xiaoyun
    Han, Jiawei
    ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 63 - +
  • [29] Non-negative Matrix Factorization on GPU
    Platos, Jan
    Gajdos, Petr
    Kroemer, Pavel
    Snasel, Vaclav
    NETWORKED DIGITAL TECHNOLOGIES, PT 1, 2010, 87 : 21 - 30
  • [30] Bayesian Non-negative Matrix Factorization
    Schmidt, Mikkel N.
    Winther, Ole
    Hansen, Lars Kai
    INDEPENDENT COMPONENT ANALYSIS AND SIGNAL SEPARATION, PROCEEDINGS, 2009, 5441 : 540 - +