A Review of Recent Alignment-free Clustering Algorithms in Expressed Sequence Tag

被引:0
|
作者
Ng, Keng-Hoong [1 ]
Phon-Amnuaisuk, Somnuk [1 ]
Ho, Chin-Kuan [1 ]
机构
[1] Multimedia Univ, Fac Informat Technol, Cyberjaya 63100, Selangor, Malaysia
关键词
DISTANCE; EST; DISSIMILARITY;
D O I
10.1109/SoCPaR.2009.18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Expressed sequence tags (ESTs) are short single pass sequence reads derived from cDNA libraries, they have been used for gene discovery, detection of splice variants, expression of genes and also transciptome analysis. Clustering of ESTs is a vital step before they can be processed further. Currently there are many EST clustering algorithms available. Basically they can be generalized into two broad approaches, i.e. alignment-based and alignment-free. The former approach is reliable but inefficient in terms of running time, while the latter approach is gaining popularity and currently under rapid development due to its faster speed and acceptable result. In this paper, we propose a taxonomy for sequence comparison algorithms and another taxonomy for EST clustering algorithms. In addition, we also highlight the peculiarities of recently introduced alignment-free EST clustering algorithms by focusing on their features, distance measures, advantages and disadvantages.
引用
收藏
页码:25 / 30
页数:6
相关论文
共 50 条
  • [1] Alignment-free sequence comparison - a review
    Vinga, S
    Almeida, J
    BIOINFORMATICS, 2003, 19 (04) : 513 - 523
  • [2] Alignment-free genetic sequence comparisons: a review of recent approaches by word analysis
    Bonham-Carter, Oliver
    Steele, Joe
    Bastola, Dhundy
    BRIEFINGS IN BIOINFORMATICS, 2014, 15 (06) : 890 - 905
  • [3] Alignment-free method for DNA sequence clustering using Fuzzy integral similarity
    Saw, Ajay Kumar
    Raj, Garima
    Das, Manashi
    Talukdar, Narayan Chandra
    Tripathy, Binod Chandra
    Nandi, Soumyadeep
    SCIENTIFIC REPORTS, 2019, 9 (1)
  • [4] Multiple alignment-free sequence comparison
    Ren, Jie
    Song, Kai
    Sun, Fengzhu
    Deng, Minghua
    Reinert, Gesine
    BIOINFORMATICS, 2013, 29 (21) : 2690 - 2698
  • [5] Alignment-free method for DNA sequence clustering using Fuzzy integral similarity
    Ajay Kumar Saw
    Garima Raj
    Manashi Das
    Narayan Chandra Talukdar
    Binod Chandra Tripathy
    Soumyadeep Nandi
    Scientific Reports, 9
  • [6] An effective extension of the applicability of alignment-free biological sequence comparison algorithms with Hadoop
    Cattaneo, Giuseppe
    Petrillo, Umberto Ferraro
    Giancarlo, Raffaele
    Roscigno, Gianluca
    JOURNAL OF SUPERCOMPUTING, 2017, 73 (04): : 1467 - 1483
  • [7] An effective extension of the applicability of alignment-free biological sequence comparison algorithms with Hadoop
    Giuseppe Cattaneo
    Umberto Ferraro Petrillo
    Raffaele Giancarlo
    Gianluca Roscigno
    The Journal of Supercomputing, 2017, 73 : 1467 - 1483
  • [8] Simplification of protein sequence and alignment-free sequence analysis
    Li Jing
    Li Feng-Bo
    Wang Wei
    PROGRESS IN BIOCHEMISTRY AND BIOPHYSICS, 2006, 33 (12) : 1215 - 1222
  • [9] A comprehensive approach to clustering of expressed human gene sequence: The sequence tag alignment and consensus knowledge base
    Miller, RT
    Christoffels, AG
    Gopalakrishnan, C
    Burke, J
    Ptitsyn, AA
    Broveak, TR
    Hide, WA
    GENOME RESEARCH, 1999, 9 (11) : 1143 - 1155
  • [10] Alignment-free viral sequence classification at scale
    Daniel J. van Zyl
    Marcel Dunaiski
    Houriiyah Tegally
    Cheryl Baxter
    Tulio de Oliveira
    Joicymara S. Xavier
    BMC Genomics, 26 (1)