Identification and classification of promoters using the attention mechanism based on long short-term memory

被引:10
|
作者
Li, Qingwen [1 ,2 ]
Zhang, Lichao [4 ]
Xu, Lei [5 ]
Zou, Quan [1 ]
Wu, Jin [6 ]
Li, Qingyuan [3 ]
机构
[1] Univ Elect Sci & Technol China, Inst Fundamental & Frontier Sci, Chengdu 610054, Peoples R China
[2] Chinese Acad Sci, Inst Biophys, State Key Lab Brain & Cognit Sci, Beijing 100101, Peoples R China
[3] Wuhan Acad Agr Sci, Forestry & Fruit Tree Res Inst, Wuhan 430075, Peoples R China
[4] Shenzhen Inst Informat Technol, Sch Intelligent Mfg & Equipment, Shenzhen 518172, Peoples R China
[5] Shenzhen Polytech, Sch Elect & Commun Engn, Shenzhen 518055, Peoples R China
[6] Shenzhen Polytech, Sch Management, Shenzhen 518055, Peoples R China
关键词
promoter; bioinformatics; natural language processing; attention mechanism; SEQUENCE-BASED PREDICTOR; RECOGNITION; SITES;
D O I
10.1007/s11704-021-0548-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A promoter is a short region of DNA that can bind RNA polymerase and initiate gene transcription. It is usually located directly upstream of the transcription initiation site. DNA promoters have been proven to be the main cause of many human diseases, especially diabetes, cancer or Huntington's disease. Therefore, the classification of promoters has become an interesting problem and has attracted the attention of many researchers in the field of bioinformatics. Various studies have been conducted in order to solve this problem, but their performance still needs further improvement. In this research, we segmented the DNA sequence in a k-mers manner, then trained the word vector model, inputted it into long short-term memory(LSTM) and used the attention mechanism to predict. Our method can achieve 93.45% and 90.59% cross-validation accuracy in the two layers, respectively. Our results are better than others based on the same data set, and provided some ideas for accurately predicting promoters. In addition, this research suggested that natural language processing can play a significant role in biological sequence prediction.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Identification and classification of promoters using the attention mechanism based on long short-term memory
    Qingwen LI
    Lichao ZHANG
    Lei XU
    Quan ZOU
    Jin WU
    Qingyuan LI
    Frontiers of Computer Science, 2022, 16 (04) : 105 - 111
  • [2] Identification and classification of promoters using the attention mechanism based on long short-term memory
    Qingwen Li
    Lichao Zhang
    Lei Xu
    Quan Zou
    Jin Wu
    Qingyuan Li
    Frontiers of Computer Science, 2022, 16
  • [3] Sentiment classification using attention mechanism and bidirectional long short-term memory network
    Wu, Peng
    Li, Xiaotong
    Ling, Chen
    Ding, Shengchun
    Shen, Si
    APPLIED SOFT COMPUTING, 2021, 112
  • [4] EEG-Based Emotion Classification Using Long Short-Term Memory Network with Attention Mechanism
    Kim, Youmin
    Choi, Ahyoung
    SENSORS, 2020, 20 (23) : 1 - 22
  • [5] Classification of tomato leaf disease using Transductive Long Short-Term Memory with an attention mechanism
    Chelladurai, Aarthi
    Manoj Kumar, D. P.
    Askar, S. S.
    Abouhawwash, Mohamed
    FRONTIERS IN PLANT SCIENCE, 2025, 15
  • [6] DGA Domain Name Classification Method Based on Long Short-Term Memory with Attention Mechanism
    Qiao, Yanchen
    Zhang, Bin
    Zhang, Weizhe
    Sangaiah, Arun Kumar
    Wu, Hualong
    APPLIED SCIENCES-BASEL, 2019, 9 (20):
  • [7] Research on Attention Classification Based on Long Short-term Memory Network
    Wang Pai
    Wu Fan
    Wang Mei
    Qin Xue-Bin
    2020 5TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2020), 2020, : 1148 - 1151
  • [8] A forecast model of short-term wind speed based on the attention mechanism and long short-term memory
    Xing, Wang
    Qi-liang, Wu
    Gui-rong, Tan
    Dai-li, Qian
    Ke, Zhou
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (15) : 45603 - 45623
  • [9] A forecast model of short-term wind speed based on the attention mechanism and long short-term memory
    Wang Xing
    Wu Qi-liang
    Tan Gui-rong
    Qian Dai-li
    Zhou Ke
    Multimedia Tools and Applications, 2024, 83 : 45603 - 45623
  • [10] MALICIOUS LOGIN DETECTION USING LONG SHORT-TERM MEMORY WITH AN ATTENTION MECHANISM
    Wu, Yanna
    Liu, Fucheng
    Wen, Yu
    ADVANCES IN DIGITAL FORENSICS XVII, 2021, 612 : 157 - 173