De novo assembly of nucleotide sequences in a compressed feature space

被引:0
作者
Tapinos, Avraam [1 ]
Robertson, David L. [1 ,2 ]
机构
[1] Univ Manchester, Sch Biol Sci, Evolut & Genom Sci, Manchester, Lancs, England
[2] Univ Glasgow, MRC, Ctr Virus Res, Garscube Campus, Glasgow, Lanark, Scotland
来源
2017 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (CIBCB) | 2017年
基金
欧盟地平线“2020”; 英国生物技术与生命科学研究理事会;
关键词
High-throughput sequencing; de novo assembly; data transformation; data compression; DFT; DWT; compressive genomics; READ ALIGNMENT; ALGORITHM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sequencing technologies allow for an in-depth analysis of biological species but the size of the generated datasets introduce a number of analytical challenges. Recently, we demonstrated the application of numerical sequence representations and data transformations for the alignment of short reads to a reference genome. Here, we expand out approach for de novo assembly of short reads. Our results demonstrate that highly compressed data can encapsulate the signal sufficiently to accurately assemble reads to big contigs or complete genomes.
引用
收藏
页码:234 / 240
页数:7
相关论文
共 42 条
[1]  
[Anonymous], ARTNODES
[2]  
[Anonymous], 2003, TIME SERIES FEATURE
[3]  
[Anonymous], 2006, CAMBRIDGE SERIES STA
[4]  
[Anonymous], 1993, PROC 4 INT C FODO
[5]   SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing [J].
Bankevich, Anton ;
Nurk, Sergey ;
Antipov, Dmitry ;
Gurevich, Alexey A. ;
Dvorkin, Mikhail ;
Kulikov, Alexander S. ;
Lesin, Valery M. ;
Nikolenko, Sergey I. ;
Son Pham ;
Prjibelski, Andrey D. ;
Pyshkin, Alexey V. ;
Sirotkin, Alexander V. ;
Vyahhi, Nikolay ;
Tesler, Glenn ;
Alekseyev, Max A. ;
Pevzner, Pavel A. .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2012, 19 (05) :455-477
[6]  
Beckmann N., 1990, R TREE EFFICIENT ROB, V19
[7]  
Bozkaya T., C P, V26, P357
[8]  
Braun S., 2008, Discover Signal Processing
[9]  
Briggs WL, 1995, The DFT: An Owner's Manual for the Discrete Fourier Transform
[10]  
Bundy A., 1984, CATALOGUE ARTICIAL I, P13