SVsearcher: A more accurate structural variation detection method in long read data

被引:3
|
作者
Zheng, Yan [1 ]
Shang, Xuequn [1 ]
Sung, Wing-Kin [2 ,3 ,4 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, West Youyi Rd 127, Xian 710072, Peoples R China
[2] Chinese Univ Hong Kong, Dept Chem Pathol, Hong Kong, Peoples R China
[3] Hong Kong Genome Inst, Shatin, Hong Kong Sci Pk, Hong Kong, Peoples R China
[4] Chinese Univ Hong Kong, Li Ka Shing Inst Hlth Sci, Lab Computat Genom, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Long-read sequencing data; Structural variations; SV detection; PAIRED-END; VARIANTS; IMPACT; INDELS; CANCER;
D O I
10.1016/j.compbiomed.2023.106843
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Structural variations (SVs) represent genomic rearrangements (such as deletions, insertions, and inversions) whose sizes are larger than 50bp. They play important roles in genetic diseases and evolution mechanism. Due to the advance of long-read sequencing (i.e. PacBio long-read sequencing and Oxford Nanopore (ONT) long-read sequencing), we can call SVs accurately. However, for ONT long reads, we observe that existing long read SV callers miss a lot of true SVs and call a lot of false SVs in repetitive regions and in regions with multi-allelic SVs. Those errors are caused by messy alignments of ONT reads due to their high error rate. Hence, we propose a novel method, SVsearcher, to solve these issues. We run SVsearcher and other callers in three real datasets and find that SVsearcher improves the F1 score by approximately 10% for high coverage (50x) datasets and more than 25% for low coverage (10x) datasets. More importantly, SVsearcher can identify 81.7%-91.8% multi-allelic SVs while existing methods only identify 13.2% (Sniffles)-54.0% (nanoSV) of them. SVsearcher is available at https://github.com/kensung-lab/SVsearcher.
引用
收藏
页码:1 / 10
页数:10
相关论文
共 50 条
  • [21] An accurate and powerful method for copy number variation detection
    Xiao, Feifei
    Luo, Xizhi
    Hao, Ning
    Niu, Yue S.
    Xiao, Xiangjun
    Cai, Guoshuai
    Amos, Christopher, I
    Zhang, Heping
    BIOINFORMATICS, 2019, 35 (17) : 2891 - 2898
  • [22] Use of a triple detection method for more accurate polymer analysis
    Ross, S
    AMERICAN LABORATORY, 1999, 31 (15) : 30 - 30
  • [23] Tradeoffs in alignment and assembly-based methods for structural variant detection with long-read sequencing data
    Yichen Henry Liu
    Can Luo
    Staunton G. Golding
    Jacob B. Ioffe
    Xin Maizie Zhou
    Nature Communications, 15
  • [24] Tradeoffs in alignment and assembly-based methods for structural variant detection with long-read sequencing data
    Liu, Yichen Henry
    Luo, Can
    Golding, Staunton G.
    Ioffe, Jacob B.
    Zhou, Xin Maizie
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [25] Accurate detection of HBV splice variant DNA by using long-read sequencing
    Hulspas, Sanne
    Knetsch, Cornelis
    Weber, Michiel
    Hout, Anne
    van Doorn, Leen-Jan
    JOURNAL OF HEPATOLOGY, 2022, 77 : S750 - S750
  • [26] Kled: an ultra-fast and sensitive structural variant detection tool for long-read sequencing data
    Zhang, Zhendong
    Jiang, Tao
    Li, Gaoyang
    Cao, Shuqi
    Liu, Yadong
    Liu, Bo
    Wang, Yadong
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (02)
  • [27] npInv: accurate detection and genotyping of inversions using long read sub-alignment
    Haojing Shao
    Devika Ganesamoorthy
    Tania Duarte
    Minh Duc Cao
    Clive J. Hoggart
    Lachlan J. M. Coin
    BMC Bioinformatics, 19
  • [28] Research on the Detection Method of Structural Variation based on Next-Generation Sequencing Data
    Yang, Hai
    2019 2ND INTERNATIONAL CONFERENCE ON MECHANICAL, ELECTRONIC AND ENGINEERING TECHNOLOGY (MEET 2019), 2019, : 160 - 164
  • [29] nplnv: accurate detection and genotyping of inversions using long read sub-alignment
    Shao, Haojing
    Ganesamoorthy, Devika
    Duarte, Tania
    Minh Duc Cao
    Hoggart, Clive J.
    Coin, Lachlan J. M.
    BMC BIOINFORMATICS, 2018, 19
  • [30] Seeksv: an accurate tool for somatic structural variation and virus integration detection
    Liang, Ying
    Qiu, Kunlong
    Liao, Bo
    Zhu, Wen
    Huang, Xuanlin
    Li, Lin
    Chen, Xiangtao
    Li, Keqin
    BIOINFORMATICS, 2017, 33 (02) : 184 - 191