Adaptive Savitzky-Golay Filters for Analysis of Copy Number Variation Peaks from Whole-Exome Sequencing Data

被引:6
作者
Ochieng, Peter Juma [1 ]
Maroti, Zoltan [2 ,3 ]
Dombi, Jozsef [1 ]
Kresz, Miklos [4 ,5 ,6 ]
Bekesi, Jozsef [1 ]
Kalmar, Tibor [2 ,3 ]
机构
[1] Univ Szeged, Inst Informat, 2 Arpad Ter, H-6720 Szeged, Hungary
[2] Univ Szeged, Albert Szent Gyorgy Hlth Ctr, Dept Pediat, H-6725 Szeged, Hungary
[3] Univ Szeged, Pediat Hlth Ctr, H-6725 Szeged, Hungary
[4] InnoRenew CoE, Livade 6, Izola 6310, Slovenia
[5] Univ Primorska, Andrej Marusic Inst, Muzejski Trg 2, Koper 6000, Slovenia
[6] Univ Szeged, Dept Appl Informat, Boldogasszony Sgt 6, H-6725 Szeged, Hungary
关键词
copy number variation; read depth; adaptive Savitzky-Golay;
D O I
10.3390/info14020128
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Copy number variation (CNV) is a form of structural variation in the human genome that provides medical insight into complex human diseases; while whole-genome sequencing is becoming more affordable, whole-exome sequencing (WES) remains an important tool in clinical diagnostics. Because of its discontinuous nature and unique characteristics of sparse target-enrichment-based WES data, the analysis and detection of CNV peaks remain difficult tasks. The Savitzky-Golay (SG) smoothing is well known as a fast and efficient smoothing method. However, no study has documented the use of this technique for CNV peak detection. It is well known that the effectiveness of the classical SG filter depends on the proper selection of the window length and polynomial degree, which should correspond with the scale of the peak because, in the case of peaks with a high rate of change, the effectiveness of the filter could be restricted. Based on the Savitzky-Golay algorithm, this paper introduces a novel adaptive method to smooth irregular peak distributions. The proposed method ensures high-precision noise reduction by dynamically modifying the results of the prior smoothing to automatically adjust parameters. Our method offers an additional feature extraction technique based on density and Euclidean distance. In comparison to classical Savitzky-Golay filtering and other peer filtering methods, the performance evaluation demonstrates that adaptive Savitzky-Golay filtering performs better. According to experimental results, our method effectively detects CNV peaks across all genomic segments for both short and long tags, with minimal peak height fidelity values (i.e., low estimation bias). As a result, we clearly demonstrate how well the adaptive Savitzky-Golay filtering method works and how its use in the detection of CNV peaks can complement the existing techniques used in CNV peak analysis.
引用
收藏
页数:21
相关论文
共 45 条
  • [41] Evaluation of tools for identifying large copy number variations from ultra-low-coverage whole-genome sequencing data
    Johannes Smolander
    Sofia Khan
    Kalaimathy Singaravelu
    Leni Kauko
    Riikka J. Lund
    Asta Laiho
    Laura L. Elo
    BMC Genomics, 22
  • [42] Evaluation of tools for identifying large copy number variations from ultra-low-coverage whole-genome sequencing data
    Smolander, Johannes
    Khan, Sofia
    Singaravelu, Kalaimathy
    Kauko, Leni
    Lund, Riikka J.
    Laiho, Asta
    Elo, Laura L.
    BMC GENOMICS, 2021, 22 (01)
  • [43] High Comorbidity of Pediatric Cancers in Patients with Birth Defects: Insights from Whole Genome Sequencing Analysis of Copy Number Variations
    Qu, Hui-Qi
    Glessner, Joseph T.
    Qu, Jingchun
    Liu, Yichuan
    Watson, Deborah
    Chang, Xiao
    Saeidian, Amir Hossein
    Qiu, Haijun
    Mentch, Frank
    Connolly, John J.
    Kakoharson, Hakon
    TRANSLATIONAL RESEARCH, 2024, 266 : 49 - 56
  • [44] Chromosomal Copy Number Variation Analysis in Pregnancy Products from Recurrent and Sporadic Miscarriage Using Next-Generation Sequencing
    Xia Zhang
    Heming Wu
    Zhonghang Gu
    Zhikang Yu
    Liubing Lan
    Qingyan Huang
    Reproductive Sciences, 2022, 29 : 2927 - 2936
  • [45] Chromosomal Copy Number Variation Analysis in Pregnancy Products from Recurrent and Sporadic Miscarriage Using Next-Generation Sequencing
    Zhang, Xia
    Wu, Heming
    Gu, Zhonghang
    Yu, Zhikang
    Lan, Liubing
    Huang, Qingyan
    REPRODUCTIVE SCIENCES, 2022, 29 (10) : 2927 - 2936