Detection of long non-coding RNA homology, a comparative study on alignment and alignment-free metrics

被引:30
|
作者
Noviello, Teresa M. R. [1 ,2 ]
Di Liddo, Antonella [3 ]
Ventola, Giovanna M. [4 ]
Spagnuolo, Antonietta [5 ]
D'Aniello, Salvatore [5 ]
Ceccarelli, Michele [1 ,2 ]
Cerulo, Luigi [1 ,2 ]
机构
[1] Univ Sannio, Dept Sci & Technol, Via Port Arsa 11, I-82100 Benevento, Italy
[2] Inst Genet Res Gaetano Salvatore, BioGeM, I-83031 Ariano Irpino, AV, Italy
[3] Goethe Univ, Buchmann Inst Mol Life Sci, Max von Laue Str 13, D-60438 Frankfurt, Germany
[4] Genomix4Life Srl, Via Salvador Allende, I-84081 Baronissi, SA, Italy
[5] Stn Zool A Dohrn, Dept Biol & Evolut Marine Organisms, I-80121 Naples, Italy
来源
BMC BIOINFORMATICS | 2018年 / 19卷
关键词
Long ncRNA; Homology; String similarity; DATABASE; EVOLUTION; GENOME; VERTEBRATE; LNCIPEDIA; SEQUENCES; DISEASE; UPDATE; MOUSE;
D O I
10.1186/s12859-018-2441-6
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundLong non-coding RNAs (lncRNAs) represent a novel class of non-coding RNAs having a crucial role in many biological processes. The identification of long non-coding homologs among different species is essential to investigate such roles in model organisms as homologous genes tend to retain similar molecular and biological functions. Alignment-based metrics are able to effectively capture the conservation of transcribed coding sequences and then the homology of protein coding genes. However, unlike protein coding genes the poor sequence conservation of long non-coding genes makes the identification of their homologs a challenging task.ResultsIn this study we compare alignment-based and alignment-free string similarity metrics and look at promoter regions as a possible source of conserved information. We show that promoter regions encode relevant information for the conservation of long non-coding genes across species and that such information is better captured by alignment-free metrics. We perform a genome wide test of this hypothesis in human, mouse, and zebrafish.ConclusionsThe obtained results persuaded us to postulate the new hypothesis that, unlike protein coding genes, long non-coding genes tend to preserve their regulatory machinery rather than their transcribed sequence. All datasets, scripts, and the prediction tools adopted in this study are available at https://github.com/bioinformatics-sannio/lncrna-homologs.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Comparative analyses of long non-coding RNA in lean and obese pigs
    Yu, Lin
    Tai, Lina
    Zhang, Lifang
    Chu, Yi
    Li, Yixing
    Zhou, Lei
    ONCOTARGET, 2017, 8 (25) : 41440 - 41450
  • [42] An efficient genetic algorithm for structural RNA pairwise alignment and its application to non-coding RNA discovery in yeast
    Akito Taneda
    BMC Bioinformatics, 9
  • [43] An efficient genetic algorithm for structural RNA pairwise alignment and its application to non-coding RNA discovery in yeast
    Taneda, Akito
    BMC BIOINFORMATICS, 2008, 9 (1)
  • [44] The BRAF activated non-coding RNA: A pivotal long non-coding RNA in human malignancies
    Liu, Xiu-Fen
    Hao, Ji-Long
    Xie, Tian
    Pant, Om Prakash
    Lu, Cheng-Bo
    Lu, Cheng-Wei
    Zhou, Dan-Dan
    CELL PROLIFERATION, 2018, 51 (04)
  • [45] Long non-coding RNA modifies chromatin Epigenetic silencing by long non-coding RNAs
    Saxena, Alka
    Carninci, Piero
    BIOESSAYS, 2011, 33 (11) : 830 - 839
  • [46] S-conLSH: alignment-free gapped mapping of noisy long reads
    Angana Chakraborty
    Burkhard Morgenstern
    Sanghamitra Bandyopadhyay
    BMC Bioinformatics, 22
  • [47] S-conLSH: alignment-free gapped mapping of noisy long reads
    Chakraborty, Angana
    Morgenstern, Burkhard
    Bandyopadhyay, Sanghamitra
    BMC BIOINFORMATICS, 2021, 22 (01)
  • [48] Weakly Alignment-Free RGBT Salient Object Detection With Deep Correlation Network
    Tu, Zhengzheng
    Li, Zhun
    Li, Chenglong
    Tang, Jin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 3752 - 3764
  • [49] A Fast Alignment-Free Approach for De Novo Detection of Protein Conserved Regions
    Abnousi, Armen
    Broschat, Shira L.
    Kalyanaraman, Ananth
    PLOS ONE, 2016, 11 (08):
  • [50] STRAL: progressive alignment of non-coding RNA using base pairing probability vectors in quadratic time
    Dalli, Deniz
    Wilm, Andreas
    Mainz, Indra
    Steger, Gerhard
    BIOINFORMATICS, 2006, 22 (13) : 1593 - 1599