Motif-Aware PRALINE: Improving the alignment of motif regions

被引:7
|
作者
Dijkstra, Maurits [1 ]
Bawono, Punto [1 ]
Abeln, Sanne [1 ]
Feenstra, K. Anton [1 ]
Fokkink, Wan [1 ]
Heringa, Jaap [1 ]
机构
[1] Vrije Univ Amsterdam, Dept Comp Sci, Amsterdam, Netherlands
关键词
SEQUENCE ALIGNMENT; PARACOCCUS-DENITRIFICANS; MULTIPLE; DATABASE; REDUCTASE;
D O I
10.1371/journal.pcbi.1006547
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Protein or DNA motifs are sequence regions which possess biological importance. These regions are often highly conserved among homologous sequences. The generation of multiple sequence alignments (MSAs) with a correct alignment of the conserved sequence motifs is still difficult to achieve, due to the fact that the contribution of these typically short fragments is overshadowed by the rest of the sequence. Here we extended the PRALINE multiple sequence alignment program with a novel motif-aware MSA algorithm in order to address this shortcoming. This method can incorporate explicit information about the presence of externally provided sequence motifs, which is then used in the dynamic programming step by boosting the amino acid substitution matrix towards the motif. The strength of the boost is controlled by a parameter, a. Using a benchmark set of alignments we confirm that a good compromise can be found that improves the matching of motif regions while not significantly reducing the overall alignment quality. By estimating a on an unrelated set of reference alignments we find there is indeed a strong conservation signal for motifs. A number of typical but difficult MSA use cases are explored to exemplify the problems in correctly aligning functional sequence motifs and how the motif-aware alignment method can be employed to alleviate these problems.
引用
收藏
页数:19
相关论文
共 50 条
  • [11] EdMot: An Edge Enhancement Approach for Motif-aware Community Detection
    Li, Pei-Zhen
    Huang, Ling
    Wang, Chang-Dong
    Lai, Jian-Huang
    KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, : 479 - 487
  • [12] ConBind: motif-aware cross-species alignment for the identification of functional transcription factor binding sites
    Lelieveld, Stefan H.
    Schutte, Judith
    Dijkstra, Maurits J. J.
    Bawono, Punto
    Kinston, Sarah J.
    Gottgens, Berthold
    Heringa, Jaap
    Bonzanni, Nicola
    NUCLEIC ACIDS RESEARCH, 2016, 44 (08) : e72
  • [13] Graph embedding based on motif-aware feature propagation for community detection
    Wu, Xunlian
    Zhang, Han
    Quan, Yining
    Miao, Qiguang
    Sun, Peng Gang
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2023, 630
  • [14] Motif-Aware Riemannian Graph Neural Network with Generative-Contrastive Learning
    Sun, Li
    Huang, Zhenhao
    Wang, Zixi
    Wang, Feiyang
    Peng, Hao
    Yu, Philip
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 8, 2024, : 9044 - 9052
  • [15] Motif-Aware miRNA-Disease Association Prediction via Hierarchical Attention Network
    Zhao, Bo-Wei
    He, Yi-Zhou
    Su, Xiao-Rui
    Yang, Yue
    Li, Guo-Dong
    Huang, Yu-An
    Hu, Peng-Wei
    You, Zhu-Hong
    Hu, Lun
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (07) : 4281 - 4294
  • [16] Neural Temporal Walks: Motif-Aware Representation Learning on Continuous-Time Dynamic Graphs
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [17] Multi-purpose RNA language modelling with motif-aware pretraining and type-guided fine-tuning
    Wang, Ning
    Bian, Jiang
    Li, Yuchen
    Li, Xuhong
    Mumtaz, Shahid
    Kong, Linghe
    Xiong, Haoyi
    NATURE MACHINE INTELLIGENCE, 2024, 6 (05) : 548 - 557
  • [18] Observation of η-Al41Sm5 reveals motif-aware structural evolution in Al-Sm alloys
    Ye, Z.
    Meng, F.
    Zhang, F.
    Sun, Y.
    Yang, L.
    Zhou, S. H.
    Napolitano, R. E.
    Mendelev, M. I.
    Ott, R. T.
    Kramer, M. J.
    Wang, C. Z.
    Ho, K. M.
    SCIENTIFIC REPORTS, 2019, 9 (1)
  • [19] Observation of η-Al41Sm5 reveals motif-aware structural evolution in Al-Sm alloys
    Z. Ye
    F. Meng
    F. Zhang
    Y. Sun
    L. Yang
    S. H. Zhou
    R. E. Napolitano
    M. I. Mendelev
    R. T. Ott
    M. J. Kramer
    C. Z. Wang
    K. M. Ho
    Scientific Reports, 9
  • [20] An Exploration Into Improving DNA Motif Inference by Looking for Highly Conserved Core Regions
    Thompson, Jeffrey A.
    Congdon, Clare Bates
    PROCEEDINGS OF THE 2013 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (CIBCB), 2013, : 60 - 67