Investigating the Performance of Oxford Nanopore Long-Read Sequencing with Respect to Illumina Microarrays and Short-Read Sequencing

被引:0
作者
Santos, Renato [1 ]
Lee, Hyunah [2 ]
Williams, Alexander [2 ]
Baffour-Kyei, Anastasia [2 ]
Lee, Sang-Hyuck [2 ]
Troakes, Claire [3 ]
Al-Chalabi, Ammar [3 ]
Breen, Gerome [2 ]
Iacoangeli, Alfredo [1 ,3 ,4 ,5 ]
机构
[1] Kings Coll London, Dept Biostat & Hlth Informat, Inst Psychiat Psychol & Neurosci, 16 De Crespigny Pk, London SE5 8AB, England
[2] Kings Coll London, Social Genet & Dev Psychiat Ctr, Inst Psychiat Psychol & Neurosci, 16 De Crespigny Pk, London SE5 8AB, England
[3] Kings Coll London, Inst Psychiat Psychol & Neurosci, Dept Basic & Clin Neurosci, 5 Cutcombe Rd, London SE5 9RX, England
[4] Ground RR Block QE Med Ctr Ralph & Patricia Sarich, Perron Inst Neurol & Translat Sci, 8 Verdun St, Nedlands, WA 6009, Australia
[5] South London & Maudsley NHS Fdn Trust, NIHR Maudsley Biomed Res Ctr BRC, 16 De Crespigny Pk, London SE5 8AF, England
基金
英国经济与社会研究理事会; 英国医学研究理事会; 欧盟地平线“2020”; 欧洲研究理事会; 英国工程与自然科学研究理事会;
关键词
Oxford Nanopore Technologies; long-read sequencing; short-read sequencing; variant calling; benchmark; genomic variants; low-complexity regions; multiplexing; experimental variables; MEDICAL GENETICS; AMERICAN-COLLEGE; ASSOCIATION; GUIDELINES; STANDARDS; GENOMICS;
D O I
10.3390/ijms26104492
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Oxford Nanopore Technologies (ONT) long-read sequencing (LRS) has emerged as a promising genomic analysis tool, yet comprehensive benchmarks with established platforms across diverse datasets remain limited. This study aimed to benchmark LRS performance against Illumina short-read sequencing (SRS) and microarrays for variant detection across different genomic contexts and to evaluate the impact of experimental factors. We sequenced 14 human genomes using the three platforms and evaluated single nucleotide variants (SNVs), insertions/deletions (indels), and structural variants (SVs) detection, stratifying by high-complexity, low-complexity, and dark genome regions while assessing effects of multiplexing, depth, and read length. LRS SNV accuracy was slightly lower than that of SRS in high-complexity regions (F-measure: 0.954 vs. 0.967) but showed comparable sensitivity in low-complexity regions. LRS showed robust performance for small (1-5 bp) indels in high-complexity regions (F-measure: 0.869), but SRS agreement decreased significantly in low-complexity regions and for larger indel sizes. Within dark regions, LRS identified more indels than SRS, but showed lower base-level accuracy. LRS identified 2.86 times more SVs than SRS, excelling at detecting large variants (>6 kb), with SV detection improving with sequencing depth. Sequencing depth strongly influenced variant calling performance, whereas multiplexing effects were minimal. Our findings provide valuable insights for optimising LRS applications in genomic research and diagnostics.
引用
收藏
页数:31
相关论文
共 71 条
[31]   Comparative evaluation of SNVs, indels, and structural variations detected with short- and long-read sequencing data [J].
Kosugi, Shunichi ;
Terao, Chikashi .
HUMAN GENOME VARIATION, 2024, 11 (01)
[32]   Best practices for benchmarking germline small-variant calls in human genomes [J].
Krusche, Peter ;
Trigg, Len ;
Boutros, Paul C. ;
Mason, Christopher E. ;
De La Vega, Francisco M. ;
Moore, Benjamin L. ;
Gonzalez-Porta, Mar ;
Eberle, Michael A. ;
Tezak, Zivana ;
Lababidi, Samir ;
Truty, Rebecca ;
Asimenos, George ;
Funke, Birgit ;
Fleharty, Mark ;
Chapman, Brad A. ;
Salit, Marc ;
Zook, Justin M. .
NATURE BIOTECHNOLOGY, 2019, 37 (05) :555-+
[33]  
Labs E.M., Human Variation Workflow
[34]  
Labs E.M., Basecalling Workflow
[35]   DNAscan: personal computer compatible NGS analysis, annotation and visualisation [J].
Lacoangeli, A. ;
Al Khleifat, A. ;
Sproviero, W. ;
Shatunov, A. ;
Jones, A. R. ;
Morgan, S. L. ;
Pittman, A. ;
Dobson, R. J. ;
Newhouse, S. J. ;
Al-Chalabi, A. .
BMC BIOINFORMATICS, 2019, 20 (1)
[36]   Initial sequencing and analysis of the human genome [J].
Lander, ES ;
Int Human Genome Sequencing Consortium ;
Linton, LM ;
Birren, B ;
Nusbaum, C ;
Zody, MC ;
Baldwin, J ;
Devon, K ;
Dewar, K ;
Doyle, M ;
FitzHugh, W ;
Funke, R ;
Gage, D ;
Harris, K ;
Heaford, A ;
Howland, J ;
Kann, L ;
Lehoczky, J ;
LeVine, R ;
McEwan, P ;
McKernan, K ;
Meldrim, J ;
Mesirov, JP ;
Miranda, C ;
Morris, W ;
Naylor, J ;
Raymond, C ;
Rosetti, M ;
Santos, R ;
Sheridan, A ;
Sougnez, C ;
Stange-Thomann, N ;
Stojanovic, N ;
Subramanian, A ;
Wyman, D ;
Rogers, J ;
Sulston, J ;
Ainscough, R ;
Beck, S ;
Bentley, D ;
Burton, J ;
Clee, C ;
Carter, N ;
Coulson, A ;
Deadman, R ;
Deloukas, P ;
Dunham, A ;
Dunham, I ;
Durbin, R ;
French, L .
NATURE, 2001, 409 (6822) :860-921
[37]  
Laver T, 2015, Biomol Detect Quantif, V3, P1
[38]   Minimap2: pairwise alignment for nucleotide sequences [J].
Li, Heng .
BIOINFORMATICS, 2018, 34 (18) :3094-3100
[39]   Repetitive DNA sequence detection and its role in the human genome [J].
Liao, Xingyu ;
Zhu, Wufei ;
Zhou, Juexiao ;
Li, Haoyang ;
Xu, Xiaopeng ;
Zhang, Bin ;
Gao, Xin .
COMMUNICATIONS BIOLOGY, 2023, 6 (01)
[40]   Long-read human genome sequencing and its applications [J].
Logsdon, Glennis A. ;
Vollger, Mitchell R. ;
Eichler, Evan E. .
NATURE REVIEWS GENETICS, 2020, 21 (10) :597-614