An intelligent computational model for prediction of promoters and their strength via natural language processing

被引:16
作者
Tahir, Muhammad [1 ,2 ]
Hayat, Maqsood [1 ]
Gul, Sarah [4 ]
Chong, Kil To [2 ,3 ]
机构
[1] Abdul Wali Khan Univ, Dept Comp Sci, Mardan 23200, KP, Pakistan
[2] Chonbuk Natl Univ, Dept Elect & Informat Engn, Jeonju 54896, South Korea
[3] Chonbuk Natl Univ, Adv Elect & Informat Res Ctr, Jeonju 54896, South Korea
[4] Int Islamic Univ, Dept Biol Sci, FBAS, Islamabad, Pakistan
基金
新加坡国家研究基金会;
关键词
Promoters; Convolution neural network (CNN); Natural language processing; DNA; word2vec; SEQUENCE-BASED PREDICTOR; RECOMBINATION SPOTS; ENSEMBLE CLASSIFIER; PROTEIN TYPES; IDENTIFICATION; SITES; FEATURES; SPACE; DISCRIMINATION; TRINUCLEOTIDE;
D O I
10.1016/j.chemolab.2020.104034
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In DNA, a promoter is an essential part of genes that controls the transcription of specific genes in a particular tissue or cells. The combination of RNA polymerase and a number of various proteins named "sigma-factors" can define the transcription start site (TSS) by inducing RNA holoenzyme. Further, Promoter is categorized into strong and weak promoters on the basis of promoter strength. Owing to exponential increase of RNA/DNA and protein samples in the post-genomic era, developing a simple and efficient sequential-based intelligent computational model for the discrimination of promoters is a challenging job. An intelligent computational model namely: 2L-iPSW(word2vec) was introduced for discrimination of promoters and their strength, in this regard. Machine learning and Deep learning algorithms in conjunction with natural language processing method i.e., "word2vec" are used. The proposed computational model 2L-iPSW(word2vec) achieved 91.42% of accuracy for 1st layer contains promoters and non-promoters which is 8.29% higher than the existing model, whereas 82.42% of accuracy for 2nd layer identifies strong promoter and weak promoter which is 11.22% advanced than the present model. Proposed 2L-iPSW(word2vec) model obtained efficient success rates than the present models in terms of all assessment metrics. It is thus greatly observed that the 2L-iPSW(word2vec) model will lead a useful tool for academic research on promoter identification.
引用
收藏
页数:7
相关论文
共 50 条
  • [41] Hidden Markov model and its application in natural language processing
    Gao, Xuexia
    Zhu, Nan
    Information Technology Journal, 2013, 12 (17) : 4256 - 4261
  • [42] Natural language processing analysis method of neural network model
    Zhuang, Wei
    2021 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE BIG DATA AND INTELLIGENT SYSTEMS (HPBD&IS), 2021, : 47 - 51
  • [43] Natural Language Processing approach to NLP Meta model automation
    Amirhosseini, Mohammad Hossein
    Kazemian, Hassan B.
    Ouazzane, Karim
    Chandler, Chris
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018, : 186 - 193
  • [44] A computational linguistic approach to natural language processing with applications to garden path sentences analysis
    Du Jia-li
    Yu Ping-fang
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2012, 3 (09) : 61 - 75
  • [45] Natural Language Processing for Depression Prediction on Sina Weibo: Method Study and Analysis
    Zhang, Zhenwen
    Zhu, Jianghong
    Guo, Zhihua
    Zhang, Yu
    Li, Zepeng
    Hu, Bin
    JMIR MENTAL HEALTH, 2024, 11
  • [46] Manufacturing process encoding through natural language processing for prediction of material properties
    Costa, Ana P. O.
    Seabra, Mariana R. R.
    de Sa, Jose M. A. Cesar
    Santos, Abel D.
    COMPUTATIONAL MATERIALS SCIENCE, 2024, 237
  • [47] A Natural Language Processing Model for the Development of an Italian-Language Chatbot for Public Administration
    Piizzi, Antonio
    Vavallo, Donatello
    Lazzo, Gaetano
    Dimola, Saverio
    Zazzera, Elvira
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (09) : 54 - 58
  • [48] Classification and Prediction of Breast Cancer Data derived Using Natural Language Processing
    Rani, Johanna Johnsi G.
    Gladis, Dennis
    Mammen, Joy
    PROCEEDING OF THE THIRD INTERNATIONAL SYMPOSIUM ON WOMEN IN COMPUTING AND INFORMATICS (WCI-2015), 2015, : 250 - 255
  • [49] A Multi-stage Approach to Facilitate Interaction with Intelligent Environments via Natural Language
    Stefanidi, Zinovia
    Leonidis, Asterios
    Antona, Margherita
    HCI INTERNATIONAL 2019 - LATE BREAKING POSTERS, HCII 2019, 2019, 1088 : 67 - 77
  • [50] Natural Language Processing Enabled Cognitive Disease Prediction Model for Varied Medical Records Implemented over ML Techniques
    Kamra, Vikas
    Kumar, Praveen
    Mohammadian, Masoud
    ICSPC'21: 2021 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION (ICPSC), 2021, : 494 - 498