An intelligent computational model for prediction of promoters and their strength via natural language processing

被引:16
作者
Tahir, Muhammad [1 ,2 ]
Hayat, Maqsood [1 ]
Gul, Sarah [4 ]
Chong, Kil To [2 ,3 ]
机构
[1] Abdul Wali Khan Univ, Dept Comp Sci, Mardan 23200, KP, Pakistan
[2] Chonbuk Natl Univ, Dept Elect & Informat Engn, Jeonju 54896, South Korea
[3] Chonbuk Natl Univ, Adv Elect & Informat Res Ctr, Jeonju 54896, South Korea
[4] Int Islamic Univ, Dept Biol Sci, FBAS, Islamabad, Pakistan
基金
新加坡国家研究基金会;
关键词
Promoters; Convolution neural network (CNN); Natural language processing; DNA; word2vec; SEQUENCE-BASED PREDICTOR; RECOMBINATION SPOTS; ENSEMBLE CLASSIFIER; PROTEIN TYPES; IDENTIFICATION; SITES; FEATURES; SPACE; DISCRIMINATION; TRINUCLEOTIDE;
D O I
10.1016/j.chemolab.2020.104034
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In DNA, a promoter is an essential part of genes that controls the transcription of specific genes in a particular tissue or cells. The combination of RNA polymerase and a number of various proteins named "sigma-factors" can define the transcription start site (TSS) by inducing RNA holoenzyme. Further, Promoter is categorized into strong and weak promoters on the basis of promoter strength. Owing to exponential increase of RNA/DNA and protein samples in the post-genomic era, developing a simple and efficient sequential-based intelligent computational model for the discrimination of promoters is a challenging job. An intelligent computational model namely: 2L-iPSW(word2vec) was introduced for discrimination of promoters and their strength, in this regard. Machine learning and Deep learning algorithms in conjunction with natural language processing method i.e., "word2vec" are used. The proposed computational model 2L-iPSW(word2vec) achieved 91.42% of accuracy for 1st layer contains promoters and non-promoters which is 8.29% higher than the existing model, whereas 82.42% of accuracy for 2nd layer identifies strong promoter and weak promoter which is 11.22% advanced than the present model. Proposed 2L-iPSW(word2vec) model obtained efficient success rates than the present models in terms of all assessment metrics. It is thus greatly observed that the 2L-iPSW(word2vec) model will lead a useful tool for academic research on promoter identification.
引用
收藏
页数:7
相关论文
共 50 条
  • [31] Natural language processing in law: Prediction of outcomes in the higher courts of Turkey
    Mumcuoglu, Emre
    Ozturk, Ceyhun E.
    Ozaktas, Haldun M.
    Koc, Aykut
    INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (05)
  • [32] Hybrid Natural Language Processing Model for Sentiment Analysis during Natural Crisis
    Horvat, Marko
    Gledec, Gordan
    Leontic, Fran
    ELECTRONICS, 2024, 13 (10)
  • [33] ELE-Intelligent Tutor: A computational parser for the processing of grammatical errors in Spanish as a Foreign Language
    Ferreira, Anita
    Kotz, Gabriela
    REVISTA SIGNOS, 2010, 43 (73): : 211 - 236
  • [34] NLPEI: A Novel Self-Interacting Protein Prediction Model Based on Natural Language Processing and Evolutionary Information
    Jia, Li-Na
    Yan, Xin
    You, Zhu-Hong
    Zhou, Xi
    Li, Li-Ping
    Wang, Lei
    Song, Ke-Jian
    EVOLUTIONARY BIOINFORMATICS, 2020, 16
  • [35] Intelligent compilation of patent summaries using machine learning and natural language processing techniques
    Trappey, Amy J. C.
    Trappey, Charles V.
    Wu, Jheng-Long
    Wang, Jack W. C.
    ADVANCED ENGINEERING INFORMATICS, 2020, 43
  • [36] Intelligent Scoring of English Composition by Machine Learning from the Perspective of Natural Language Processing
    Zhang, Dan
    Yuan, Xiaorong
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [37] The Validation and Efficiency Testing of an Intelligent Tutoring System that Uses Natural Language Processing Technologies
    Dobre, Iuliana
    PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON VIRTUAL LEARNING, 2014, : 337 - 342
  • [38] Understanding customer satisfaction via deep learning and natural language processing
    Aldunate, Angeles
    Maldonado, Sebastian
    Vairetti, Carla
    Armelini, Guillermo
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 209
  • [39] Extracting phenotypic information from the literature via natural language processing
    Chen, LF
    Friedman, C
    MEDINFO 2004: PROCEEDINGS OF THE 11TH WORLD CONGRESS ON MEDICAL INFORMATICS, PT 1 AND 2, 2004, 107 : 758 - 762
  • [40] Natural Language Processing Model for Managing Maintenance Requests in Buildings
    Bouabdallaoui, Yassine
    Lafhaj, Zoubeir
    Yim, Pascal
    Ducoulombier, Laure
    Bennadji, Belkacem
    BUILDINGS, 2020, 10 (09)