Decoy Selection in Protein Structure Determination via Symmetric Non-negative Matrix Factorization

被引:5
|
作者
Kabir, Kazi Lutful [1 ]
Chennupati, Gopinath [2 ]
Vangara, Raviteja [3 ]
Djidjev, Hristo [2 ]
Alexandrov, Boian S. [4 ]
Shehu, Amarda [1 ]
机构
[1] George Mason Univ, Dept Comp Sci, Fairfax, VA 22030 USA
[2] Los Alamos Natl Lab, Informat Sci CCS Grp 3, Los Alamos, NM USA
[3] Los Alamos Natl Lab, Fluid Dynam & Solid Mech T3, Los Alamos, NM USA
[4] Los Alamos Natl Lab, Phys & Chem Mat T1, Los Alamos, NM USA
基金
美国国家科学基金会;
关键词
decoy selection; eigen-gap heuristic; graph clustering; protein structure determination; symmetric NMF;
D O I
10.1109/BIBM49941.2020.9313299
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The so-called dark proteome, referring to regions of the protein universe that remain inaccessible by either wet-or dry-laboratory methods, continues to spur computational research in protein structure determination. An outstanding challenge relates to the ability to discriminate relevant tertiary structure(s) among many structures, also referred to as decoys, that are computed for a protein of interest. The problem is known as decoy selection. While prime for investigation as an inference problem, the decoy datasets generated in silico are sparse and highly imbalanced towards the negative class (irrelevant structures). These characteristics continue to challenge both supervised and unsupervised learning approaches to this problem. In this paper, we propose a novel decoy selection method based on symmetric non-negative matrix factorization in a graph clustering setting. The method is evaluated on two datasets, a benchmark dataset of ensembles of decoys for a varied list of protein molecules, and a dataset of decoy ensembles for targets drawn from the recent CASP competitions. The evaluation demonstrates that the proposed method outperforms several state-of-the-art decoy selection methods. This performance, as well as the method's computational expediency, suggest that the proposed method advances the state of the art in decoy selection and, in particular, our the ability to tackle inherent challenges related to imbalanced datasets.
引用
收藏
页码:23 / 28
页数:6
相关论文
共 50 条
  • [31] On affine non-negative matrix factorization
    Laurberg, Hans
    Hansen, Lars Kai
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PTS 1-3, 2007, : 653 - +
  • [32] Multi-label feature selection via similarity constraints with non-negative matrix factorization
    He, Zhuoxin
    Lin, Yaojin
    Lin, Zilong
    Wang, Chenxi
    KNOWLEDGE-BASED SYSTEMS, 2024, 297
  • [33] Non-Negative Matrix Factorization and Term Structure of Interest Rates
    Takada, Hellinton H.
    Stern, Julio M.
    BAYESIAN INFERENCE AND MAXIMUM ENTROPY METHODS IN SCIENCE AND ENGINEERING (MAXENT 2014), 2015, 1641 : 369 - 377
  • [34] Structure preserving non-negative matrix factorization for dimensionality reduction
    Li, Zechao
    Liu, Jing
    Lu, Hanqing
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2013, 117 (09) : 1175 - 1189
  • [35] Automatic Model Order Selection for Convolutive Non-Negative Matrix Factorization
    Li, Yinan
    Zhang, Xiongwei
    Sun, Meng
    Jia, Chong
    Zou, Xia
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2016, E99A (10) : 1867 - 1870
  • [36] A Hybrid Algorithm for Non-negative Matrix Factorization Based on Symmetric Information Divergence
    Devarajan, Karthik
    Ebrahimi, Nader
    Soofi, Ehsan
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2015, : 1658 - 1664
  • [37] Symmetric Non-negative Matrix Factorization for Analyzing the Scientific Production on Day Surgery
    Grassia, Maria Gabriella
    Marino, Marina
    Mazza, Rocco
    Stavolo, Agostino
    Massa, Salvatore
    NEW FRONTIERS IN TEXTUAL DATA ANALYSIS, JADT 2022, 2024, : 65 - 76
  • [38] A Two-Phase Algorithm for Robust Symmetric Non-Negative Matrix Factorization
    Li, Bingjie
    Shi, Xi
    Zhang, Zhenyue
    SYMMETRY-BASEL, 2021, 13 (09):
  • [39] Structure Selection for Convolutive Non-negative Matrix Factorization Using Normalized Maximum Likelihood Coding
    Suzuki, Atsushi
    Miyaguchi, Kohei
    Yamanishi, Kenji
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2016, : 1221 - 1226
  • [40] Constrained Symmetric Non-Negative Matrix Factorization with Deep Autoencoders for Community Detection
    Zhang, Wei
    Yu, Shanshan
    Wang, Ling
    Guo, Wei
    Leung, Man-Fai
    MATHEMATICS, 2024, 12 (10)