ASRpro: A machine-learning computational model for identifying proteins associated with multiple abiotic stress in plants

被引:6
|
作者
Meher, Prabina Kumar [1 ]
Sahu, Tanmaya Kumar [2 ]
Gupta, Ajit [1 ]
Kumar, Anuj [3 ,4 ]
Rustgi, Sachin [5 ]
机构
[1] ICAR Indian Agr Stat Res Inst, New Delhi, India
[2] ICAR Natl Bur Plant Genet Resources, New Delhi, India
[3] Dalhousie Univ, Dept Microbiol & Immunol, Halifax, NS, Canada
[4] Shantou Univ Med Coll, Lab Immun, Shantou, Peoples R China
[5] Clemson Univ, Pee Dee Res & Educ Ctr, Dept Plant & Environm Sci, Florence, SC 29506 USA
来源
PLANT GENOME | 2024年 / 17卷 / 01期
关键词
COVARIANCE TRANSFORMATION; GENES; PREDICTION; CLASSIFICATION; RESPONSES; ARABIDOPSIS; SELECTION; SITES;
D O I
10.1002/tpg2.20259
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
One of the thrust areas of research in plant breeding is to develop crop cultivars with enhanced tolerance to abiotic stresses. Thus, identifying abiotic stress-responsive genes (SRGs) and proteins is important for plant breeding research. However, identifying such genes via established genetic approaches is laborious and resource intensive. Although transcriptome profiling has remained a reliable method of SRG identification, it is species specific. Additionally, identifying multistress responsive genes using gene expression studies is cumbersome. Thus, endorsing the need to develop a computational method for identifying the genes associated with different abiotic stresses. In this work, we aimed to develop a computational model for identifying genes responsive to six abiotic stresses: cold, drought, heat, light, oxidative, and salt. The predictions were performed using support vector machine (SVM), random forest, adaptive boosting (ADB), and extreme gradient boosting (XGB), where the autocross covariance (ACC) and K-mer compositional features were used as input. With ACC, K-mer, and ACC + K-mer compositional features, the overall accuracy of similar to 60-77, similar to 75-86, and similar to 61-78% were respectively obtained using the SVM algorithm with fivefold cross-validation. The SVM also achieved higher accuracy than the other three algorithms. The proposed model was also assessed with an independent dataset and obtained an accuracy consistent with cross-validation. The proposed model is the first of its kind and is expected to serve the requirement of experimental biologists; however, the prediction accuracy was modest. Given its importance for the research community, the online prediction application, ASRpro, is made freely available (https://iasri-sg.icar.gov.in/asrpro/) for predicting abiotic SRGs and proteins.
引用
收藏
页数:13
相关论文
共 15 条
  • [1] ASPTF: A computational tool to predict abiotic stress-responsive transcription factors in plants by employing machine learning algorithms
    Pradhan, Upendra Kumar
    Mahapatra, Anuradha
    Naha, Sanchita
    Gupta, Ajit
    Parsad, Rajender
    Gahlaut, Vijay
    Rath, Surya Narayan
    Meher, Prabina Kumar
    BIOCHIMICA ET BIOPHYSICA ACTA-GENERAL SUBJECTS, 2024, 1868 (06):
  • [2] ASmiR: a machine learning framework for prediction of abiotic stress-specific miRNAs in plants
    Pradhan, Upendra Kumar
    Meher, Prabina Kumar
    Naha, Sanchita
    Rao, Atmakuri Ramakrishna
    Kumar, Upendra
    Pal, Soumen
    Gupta, Ajit
    FUNCTIONAL & INTEGRATIVE GENOMICS, 2023, 23 (02)
  • [3] Comparative analysis of machine learning based classification for abiotic stress proteins
    Ahmed, Bulbul
    Rai, Anil
    Iquebal, Mir Asif
    Jaiswal, Sarika
    INDIAN JOURNAL OF AGRICULTURAL SCIENCES, 2021, 91 (06): : 861 - 866
  • [4] Multiple machine-learning tools identifying prognostic biomarkers for acute Myeloid Leukemia
    Cheng, Yujing
    Yang, Xin
    Wang, Ying
    Li, Qi
    Chen, Wanlu
    Dai, Run
    Zhang, Chan
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2024, 24 (01)
  • [5] Machine Learning Model for Identifying Antioxidant Proteins Using Features Calculated from Primary Sequences
    Ho Thanh Lam, Luu
    Le, Ngoc Hoang
    Van Tuan, Le
    Tran Ban, Ho
    Nguyen Khanh Hung, Truong
    Nguyen, Ngan Thi Kim
    Huu Dang, Luong
    Le, Nguyen Quoc Khanh
    BIOLOGY-BASEL, 2020, 9 (10): : 1 - 13
  • [6] Early detection of abiotic stress in plants through SNARE proteins using hybrid feature fusion model
    Bhargavi, T.
    Sumathi, D.
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [7] Machine-learning based prediction model for acute kidney injury induced by multiple wasp stings
    Wu, Wen
    Zhang, Yupei
    Zhang, Yilan
    Qu, Xingguang
    Zhang, Zhaohui
    Zhang, Rong
    TOXICON, 2024, 250
  • [8] ApicoAP: The First Computational Model for Identifying Apicoplast-Targeted Proteins in Multiple Species of Apicomplexa
    Cilingir, Gokcen
    Broschat, Shira L.
    Lau, Audrey O. T.
    PLOS ONE, 2012, 7 (05):
  • [9] Development of multiple machine-learning computational techniques for optimization of heterogenous catalytic biodiesel production from waste vegetable oil
    Abdelbasset, Walid Kamal
    Elkholi, Safaa M.
    Opulencia, Maria Jade Catalan
    Diana, Tazeddinova
    Su, Chia-Hung
    Alashwal, May
    Zwawi, Mohammed
    Algarni, Mohammed
    Abdelrahman, Anas
    Hoang Chinh Nguyen
    ARABIAN JOURNAL OF CHEMISTRY, 2022, 15 (06)
  • [10] Transcriptome signature for multiple biotic and abiotic stress in barley ( Hordeum vulgare L.) identifies using machine learning approach
    Panahi, Bahman
    CURRENT PLANT BIOLOGY, 2024, 40