Conotoxin Prediction: New Features to Increase Prediction Accuracy

被引:3
|
作者
Monroe, Lyman K. [1 ]
Truong, Duc P. [2 ]
Miner, Jacob C. [1 ]
Adikari, Samantha H. [1 ]
Sasiene, Zachary J. [1 ]
Fenimore, Paul W. [2 ]
Alexandrov, Boian [2 ]
Williams, Robert F. [1 ]
Nguyen, Hau B. [1 ]
机构
[1] Los Alamos Natl Lab, Biosci Div, MS M888, Los Alamos, NM 87545 USA
[2] Los Alamos Natl Lab, Theoret Div, MS M888, Los Alamos, NM 87545 USA
关键词
conotoxins; machine learning; collisional cross section; post-translational modifications; prediction; ion mobility-mass spectrometry; PROTEIN SECONDARY STRUCTURE; CHANNELS; PDB2PQR; TOXINS; SMOTE;
D O I
10.3390/toxins15110641
中图分类号
TS2 [食品工业];
学科分类号
0832 ;
摘要
Conotoxins are toxic, disulfide-bond-rich peptides from cone snail venom that target a wide range of receptors and ion channels with multiple pathophysiological effects. Conotoxins have extraordinary potential for medical therapeutics that include cancer, microbial infections, epilepsy, autoimmune diseases, neurological conditions, and cardiovascular disorders. Despite the potential for these compounds in novel therapeutic treatment development, the process of identifying and characterizing the toxicities of conotoxins is difficult, costly, and time-consuming. This challenge requires a series of diverse, complex, and labor-intensive biological, toxicological, and analytical techniques for effective characterization. While recent attempts, using machine learning based solely on primary amino acid sequences to predict biological toxins (e.g., conotoxins and animal venoms), have improved toxin identification, these methods are limited due to peptide conformational flexibility and the high frequency of cysteines present in toxin sequences. This results in an enumerable set of disulfide-bridged foldamers with different conformations of the same primary amino acid sequence that affect function and toxicity levels. Consequently, a given peptide may be toxic when its cysteine residues form a particular disulfide-bond pattern, while alternative bonding patterns (isoforms) or its reduced form (free cysteines with no disulfide bridges) may have little or no toxicological effects. Similarly, the same disulfide-bond pattern may be possible for other peptide sequences and result in different conformations that all exhibit varying toxicities to the same receptor or to different receptors. We present here new features, when combined with primary sequence features to train machine learning algorithms to predict conotoxins, that significantly increase prediction accuracy.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] A structural characterization of shortcut features for prediction
    David Bellamy
    Miguel A. Hernán
    Andrew Beam
    European Journal of Epidemiology, 2022, 37 : 563 - 568
  • [22] Features selection and prediction for IoT attacks
    Su, Jingyi
    He, Shan
    Wu, Yan
    HIGH-CONFIDENCE COMPUTING, 2022, 2 (02):
  • [23] A structural characterization of shortcut features for prediction
    Bellamy, David
    Hernan, Miguel A.
    Beam, Andrew
    EUROPEAN JOURNAL OF EPIDEMIOLOGY, 2022, 37 (06) : 563 - 568
  • [24] Linguistic features for review helpfulness prediction
    Krishnamoorthy, Srikumar
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (07) : 3751 - 3759
  • [25] Prediction Analysis of Novel Random Forest Algorithm and K Nearest Neighbor Algorithm in Heart Disease Prediction with an Improved Accuracy Rate
    Poojitha, T.
    Mahaveerakannan, R.
    CARDIOMETRY, 2022, (25): : 1554 - 1561
  • [26] Optimization of blood glucose prediction with LSTM-XGBoost fusion and integration of statistical features for enhanced accuracy
    Mazgouti, Loubna
    Laamiri, Nacira
    Ben Ali, Jaouher
    El Idrissi, Najiba E. L. Amrani
    Di Costanzo, Veronique
    Naeck, Roomila
    Ginoux, Jean-Mark
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 107
  • [27] Reducing Features to Improve Bug Prediction
    Shivaji, Shivkumar
    Whitehead, E. James, Jr.
    Akella, Ram
    Kim, Sunghun
    2009 IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING, PROCEEDINGS, 2009, : 600 - 604
  • [28] Prediction of paroxysmal atrial fibrillation using new heart rate variability features
    Parsi, Ashkan
    Glavin, Martin
    Jones, Edward
    Byrne, Dallan
    COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 133
  • [29] Fold Prediction Problem: The Application of New Physical and Physicochemical-Based Features
    Dehzangi, Abdollah
    Phon-Amnuaisuk, Somnuk
    PROTEIN AND PEPTIDE LETTERS, 2011, 18 (02) : 174 - 185
  • [30] Prediction of serial perpetrator residence: Part II-Evaluation of prediction model accuracy
    Spaulding, Jamie S.
    Morris, Keith B.
    JOURNAL OF INVESTIGATIVE PSYCHOLOGY AND OFFENDER PROFILING, 2023, 20 (01) : 97 - 118