MSLP: mRNA subcellular localization predictor based on machine learning techniques

被引:9
|
作者
Musleh, Saleh [1 ]
Islam, Mohammad Tariqul [2 ]
Qureshi, Rizwan [1 ]
Alajez, Nihad [3 ,4 ]
Alam, Tanvir [1 ]
机构
[1] Hamad Bin Khalifa Univ, Coll Sci & Engn, Doha, Qatar
[2] Southern Connecticut State Univ, Comp Sci Dept, New Haven, CT USA
[3] Hamad Bin Khalifa Univ, Qatar Biomed Res Inst QBRI, Translat Canc & Immun Ctr TC, Doha, Qatar
[4] Hamad Bin Khalifa Univ, Coll Hlth & Life Sci, Doha, Qatar
关键词
RNA; mRNA; Machine learning; Sequence analysis; Localization prediction; Subcellular localization; NERVOUS-SYSTEM; RNALOCATE; SEQUENCES; RESOURCE;
D O I
10.1186/s12859-023-05232-0
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Subcellular localization of messenger RNA (mRNAs) plays a pivotal role in the regulation of gene expression, cell migration as well as in cellular adaptation. Experiment techniques for pinpointing the subcellular localization of mRNAs are laborious, time-consuming and expensive. Therefore, in silico approaches for this purpose are attaining great attention in the RNA community. Methods: In this article, we propose MSLP, a machine learning-based method to predict the subcellular localization of mRNA. We propose a novel combination of four types of features representing k-mer, pseudo k-tuple nucleotide composition (PseKNC), physicochemical properties of nucleotides, and 3D representation of sequences based on Z-curve transformation to feed into machine learning algorithm to predict the subcellular localization of mRNAs. Results: Considering the combination of the above-mentioned features, ennsemble-based models achieved state-of-the-art results in mRNA subcellular localization prediction tasks for multiple benchmark datasets. We evaluated the performance of our method in ten subcellular locations, covering cytoplasm, nucleus, endoplasmic reticulum (ER), extracellular region (ExR), mitochondria, cytosol, pseudopodium, posterior, exosome, and the ribosome. Ablation study highlighted k-mer and PseKNC to be more dominant than other features for predicting cytoplasm, nucleus, and ER localizations. On the other hand, physicochemical properties and Z-curve based features contributed the most to ExR and mitochondria detection. SHAP-based analysis revealed the relative importance of features to provide better insights into the proposed approach. Availability: We have implemented a Docker container and API for end users to run their sequences on our model. Datasets, the code of API and the Docker are shared for the community in GitHub at: https://github.com/smusleh/MSLP.
引用
收藏
页数:23
相关论文
共 50 条
  • [41] A Survey of Malware Detection Techniques based on Machine Learning
    El Merabet, Hoda
    Hajraoui, Abderrahmane
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (01) : 366 - 373
  • [42] Classification of facial paralysis based on machine learning techniques
    Gaber, Amira
    Taher, Mona F.
    Wahed, Manal Abdel
    Shalaby, Nevin Mohieldin
    Gaber, Sarah
    BIOMEDICAL ENGINEERING ONLINE, 2022, 21 (01)
  • [43] Detection of bone fracture based on machine learning techniques
    Dlshad Ahmed K.
    Hawezi R.
    Measurement: Sensors, 2023, 27
  • [44] Survey of cardinality estimation techniques based on machine learning
    Yue W.
    Qu W.
    Lin K.
    Wang X.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2024, 61 (02): : 413 - 427
  • [45] Classification of facial paralysis based on machine learning techniques
    Amira Gaber
    Mona F. Taher
    Manal Abdel Wahed
    Nevin Mohieldin Shalaby
    Sarah Gaber
    BioMedical Engineering OnLine, 21
  • [46] Machine learning techniques for protein function prediction
    Bonetta, Rosalin
    Valentino, Gianluca
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2020, 88 (03) : 397 - 413
  • [47] Implementation of a Gaussian process-based machine learning grasp predictor
    Alex K. Goins
    Ryan Carpenter
    Weng-Keen Wong
    Ravi Balasubramanian
    Autonomous Robots, 2016, 40 : 687 - 699
  • [48] Field Experiment of Localization based on Machine Learning in LTE network
    Kanazawa, Noboru
    Nagate, Atsushi
    Yamamoto, Atsushi
    2018 IEEE 88TH VEHICULAR TECHNOLOGY CONFERENCE (VTC-FALL), 2018,
  • [49] Learning to predict relapse in invasive ductal carcinomas based on the subcellular localization of junctional proteins
    Asgarian, Nasimeh
    Hu, Xiuying
    Aktary, Zackie
    Chapman, Kimberly Ann
    Lam, Le
    Chibbar, Rajni
    Mackey, John
    Greiner, Russ
    Pasdar, Manijeh
    BREAST CANCER RESEARCH AND TREATMENT, 2010, 121 (02) : 527 - 538
  • [50] Deep learning-based classification of protein subcellular localization from immunohistochemistry images
    Hu, Jin-Xian
    Xu, Ying-Ying
    Yang-Yang
    Shen, Hong-Bin
    PROCEEDINGS 2017 4TH IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2017, : 599 - 604