Improved detection algorithm for copy number variations based on hidden Markov model

被引:0
作者
Hai Yang
Daming Zhu
机构
[1] Shandong University,School of Computer Science and Technology
来源
Multimedia Tools and Applications | 2020年 / 79卷
关键词
Detection algorithm; Copy number variation; Hidden Markov model; Split read;
D O I
暂无
中图分类号
学科分类号
摘要
Aiming at the problems of parameter optimization and insufficient utilization of split reads in the detection for copy number variation (CNV), a new definition of relative read depth (RRD) and a randomized sampling strategy (RGN) are proposed in this paper. Compared to the raw read depth, the RRD parameter has weak correlation with GC content, mappability and the width of analysis windows tiled along the genome. The RGN strategy is based on the weighted sampling strategy which can speed up the read count data analysis. Subsequently, we propose an improved detection algorithm for CNV based on hidden Markov model (CNV-HMM). The HMM detects the abnormal signal of read count data and outputs the detection results of candidate CNVs. At the end of the algorithm, we filter out the results of candidate CNVs using the split reads to improve the performance of CNV-HMM algorithm. Finally, the experiment results show that our CNV-HMM algorithm has higher sensitivity and accuracy for CNVs detection than most of current detection algorithms and applicative both for diploid animal and plant.
引用
收藏
页码:9237 / 9253
页数:16
相关论文
共 81 条
[1]  
Abyzov A(2011)CNVnator: an approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing Genome Res 21 974-984
[2]  
Urban AE(2009)BreakDancer: an algorithm for high-resolution mapping of genomic structural variation Nat Methods 6 677-681
[3]  
Snyder M(2007)QuantiSNP: An objective Bayes Hidden-Markov Model to detect and accurately map copy number variation using SNP genotyping data Nucleic Acids Res 35 2013-2025
[4]  
Chen K(2016)Whole genome sequencing increases molecular diagnostic yield compared with current diagnostic testing for inherited retinal disease Ophthalmology 123 1143-1150
[5]  
Wallis JW(2018)Assessment of the incorporation of CNV surveillance into gene panel next-generation sequencing testing for inherited retinal diseases J Med Genet 55 114-121
[6]  
McLellan MD(2005)The influence of CCL3L1 gene-containing segmental duplications on HIV-1/AIDS susceptibility Science 307 1434-1440
[7]  
Colella S(2015)CODEX: a normalization and copy number variation detection method for whole exome sequencing Nucleic Acids Res 43 e39-252
[8]  
Yau C(2009)PEMer: a computational framework with simulation-based error models for inferring genomic structural variants from massive paired-end sequencing data Genome BioI 10 R23-1313
[9]  
Taylor JM(2015)Navigating the current landscape of clinical genetic testing for inherited retinal dystrophies Genet Med 17 245-76
[10]  
Mirza G(2012)CONTRA: copy number analysis for targeted resequencing Bioinformatics 28 1307-478