SVcnn: an accurate deep learning-based method for detecting structural variation based on long-read data

被引:7
|
作者
Zheng, Yan [1 ]
Shang, Xuequn [1 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, West Youyi Rd 127, Xian 710072, Peoples R China
基金
中国国家自然科学基金;
关键词
Long-read sequencing data; Structural variations; SV caller; Deep learning; PAIRED-END; IMPACT; VARIANTS; INDELS; CANCER;
D O I
10.1186/s12859-023-05324-x
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Structural variations (SVs) refer to variations in an organism's chromosome structure that exceed a length of 50 base pairs. They play a significant role in genetic diseases and evolutionary mechanisms. While long-read sequencing technology has led to the development of numerous SV caller methods, their performance results have been suboptimal. Researchers have observed that current SV callers often miss true SVs and generate many false SVs, especially in repetitive regions and areas with multi-allelic SVs. These errors are due to the messy alignments of long-read data, which are affected by their high error rate. Therefore, there is a need for a more accurate SV caller method. Result: We propose a new method-SVcnn, a more accurate deep learning-based method for detecting SVs by using long-read sequencing data. We run SVcnn and other SV callers in three real datasets and find that SVcnn improves the F1-score by 2-8% compared with the second-best method when the read depth is greater than 5x. More importantly, SVcnn has better performance for detecting multi-allelic SVs. Conclusions: SVcnn is an accurate deep learning-based method to detect SVs. The program is available at https://github.com/nwpuzhengyan/SVcnn.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] Deep learning-based recovery method for missing structural temperature data using LSTM network
    Liu, Hao
    Ding, You-Liang
    Zhao, Han-Wei
    Wang, Man-Ya
    Geng, Fang-Fang
    STRUCTURAL MONITORING AND MAINTENANCE, AN INTERNATIONAL JOURNAL, 2020, 7 (02): : 109 - 124
  • [22] Computer vision and deep learning-based data anomaly detection method for structural health monitoring
    Bao, Yuequan
    Tang, Zhiyi
    Li, Hui
    Zhang, Yufeng
    STRUCTURAL HEALTH MONITORING-AN INTERNATIONAL JOURNAL, 2019, 18 (02): : 401 - 421
  • [23] On detection of somatic structural variation in highly repetitive regions using long-read sequencing data
    Shiraishi, Yuichi
    CANCER SCIENCE, 2024, 115 : 31 - 31
  • [24] Deep Learning-Based Method for Detecting Traffic Flow Parameters Under Snowfall
    Jian, Cheng
    Xie, Tiancheng
    Hu, Xiaojian
    Lu, Jian
    JOURNAL OF IMAGING, 2024, 10 (12)
  • [25] SpliceSCANNER: An Accurate and Interpretable Deep Learning-Based Method for Splice Site Prediction
    Wang, Rongxing
    Xu, Junwei
    Huang, Xiaodi
    Qi, Wangjing
    Zhang, Yanju
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT III, 2023, 14088 : 447 - 459
  • [26] A deep learning-based method for detecting and identifying surface defects in polyimide foam
    Song, Xianhui
    Hu, Guangzhong
    Lu, Jing
    Tuo, Xianguo
    Li, Yuedong
    IET IMAGE PROCESSING, 2025, 19 (01)
  • [27] A cloud-based pipeline for analysis of FHIR and long-read data
    Dunn, Tim
    Cosgun, Erdal
    NEURO-ONCOLOGY ADVANCES, 2023, 5 (01)
  • [28] Tradeoffs in alignment and assembly-based methods for structural variant detection with long-read sequencing data
    Yichen Henry Liu
    Can Luo
    Staunton G. Golding
    Jacob B. Ioffe
    Xin Maizie Zhou
    Nature Communications, 15
  • [29] Tradeoffs in alignment and assembly-based methods for structural variant detection with long-read sequencing data
    Liu, Yichen Henry
    Luo, Can
    Golding, Staunton G.
    Ioffe, Jacob B.
    Zhou, Xin Maizie
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [30] Deep reinforcement learning-based sampling method for structural reliability assessment
    Xiang, Zhengliang
    Bao, Yuequan
    Tang, Zhiyi
    Li, Hui
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2020, 199