Fast Computation and Applications of Genome Mappability

被引:329
|
作者
Derrien, Thomas [1 ]
Estelle, Jordi [2 ]
Marco Sola, Santiago [2 ]
Knowles, David G. [3 ]
Raineri, Emanuele [2 ]
Guigo, Roderic [3 ]
Ribeca, Paolo [2 ]
机构
[1] Univ Rennes 1, Inst Genet & Dev IGDR, Rennes, France
[2] CNAG, Barcelona, Spain
[3] Univ Pompeu Fabra, CRG, Barcelona, Spain
来源
PLOS ONE | 2012年 / 7卷 / 01期
关键词
RNA-SEQ; SEGMENTAL DUPLICATIONS; EVOLUTION; ELEMENTS; STRATEGY;
D O I
10.1371/journal.pone.0030377
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
We present a fast mapping-based algorithm to compute the mappability of each region of a reference genome up to a specified number of mismatches. Knowing the mappability of a genome is crucial for the interpretation of massively parallel sequencing experiments. We investigate the properties of the mappability of eukaryotic DNA/RNA both as a whole and at the level of the gene family, providing for various organisms tracks which allow the mappability information to be visually explored. In addition, we show that mappability varies greatly between species and gene classes. Finally, we suggest several practical applications where mappability can be used to refine the analysis of high-throughput sequencing data (SNP calling, gene expression quantification and paired-end experiments). This work highlights mappability as an important concept which deserves to be taken into full account, in particular when massively parallel sequencing technologies are employed. The GEM mappability program belongs to the GEM (GEnome Multitool) suite of programs, which can be freely downloaded for any use from its website (http://gemlibrary.sourceforge.net).
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Efficient Meshfree Computation with Fast Treatment of Essential Boundary Conditions for Industrial Applications
    Wang, Hui-Ping
    Wang, Dongdong
    JOURNAL OF ENGINEERING MECHANICS-ASCE, 2009, 135 (10): : 1147 - 1154
  • [32] Fast online computation of the Qn estimator with applications to the detection of outliers in data streams
    Cafaro, Massimo
    Melle, Catiuscia
    Pulimeno, Marco
    Epicoco, Italo
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 164
  • [33] Fast symbolic computation of the worst-case delay in tandem networks and applications
    Bouillard, Anne
    Nowak, Thomas
    PERFORMANCE EVALUATION, 2015, 91 : 270 - 285
  • [34] Fast Engset computation
    Azimzadeh, P.
    Carpenter, T.
    OPERATIONS RESEARCH LETTERS, 2016, 44 (03) : 313 - 318
  • [35] Fast optimal genome tiling with applications to microarray design and homology search
    Berman, P
    Bertone, P
    Dasgupta, B
    Gerstein, M
    Kao, MY
    Snyder, M
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2004, 11 (04) : 766 - 785
  • [36] Fast optimal genome tiling with applications to microarray design and homology search
    Berman, P
    Bertone, P
    Dasgupta, B
    Gerstein, M
    Kao, MY
    Snyder, M
    ALGORITHMS IN BIOINFORMATICS, PROCEEDINGS, 2002, 2452 : 419 - 433
  • [37] CUDAQuat: new parallel framework for fast computation of quaternion moments for color images applications
    Khalid M. Hosny
    Mohamed M. Darwish
    Ahmad Salah
    Kenli Li
    Amr M. Abdelatif
    Cluster Computing, 2021, 24 : 2385 - 2406
  • [38] A comparative evaluation of algorithms for fast computation of level set PDES with applications to motion segmentation
    Mansouri, AR
    Chomaud, T
    Konrad, J
    2001 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL III, PROCEEDINGS, 2001, : 636 - 639
  • [39] Fast computation of the N-th term of a q-holonomic sequence and applications
    Bostan, Alin
    Yurkevich, Sergey
    JOURNAL OF SYMBOLIC COMPUTATION, 2023, 115 : 96 - 123
  • [40] Fast computation of mutual information in the frequency domain with applications to global multimodal image alignment
    Ofverstedt, Johan
    Lindblad, Joakim
    Sladoje, Natasa
    PATTERN RECOGNITION LETTERS, 2022, 159 : 196 - 203