MixHMM: Inferring Copy Number Variation and Allelic Imbalance Using SNP Arrays and Tumor Samples Mixed with Stromal Cells

被引:22
|
作者
Liu, Zongzhi [1 ]
Li, Ao [1 ]
Schulz, Vincent [2 ]
Chen, Min [3 ]
Tuck, David [1 ]
机构
[1] Yale Univ, Sch Med, Dept Pathol, New Haven, CT 06510 USA
[2] Yale Univ, Sch Med, Dept Pediat, New Haven, CT 06510 USA
[3] Yale Univ, Sch Med, Dept Epidemiol & Publ Hlth, New Haven, CT 06510 USA
来源
PLOS ONE | 2010年 / 5卷 / 06期
关键词
COMPARATIVE GENOMIC HYBRIDIZATION; HIDDEN-MARKOV MODEL; CANCER-CELLS; ASSOCIATION; DETECT; TOOL;
D O I
10.1371/journal.pone.0010909
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Genotyping platforms such as single nucleotide polymorphism (SNP) arrays are powerful tools to study genomic aberrations in cancer samples. Allele specific information from SNP arrays provides valuable information for interpreting copy number variation (CNV) and allelic imbalance including loss-of-heterozygosity (LOH) beyond that obtained from the total DNA signal available from array comparative genomic hybridization (aCGH) platforms. Several algorithms based on hidden Markov models (HMMs) have been designed to detect copy number changes and copy-neutral LOH making use of the allele information on SNP arrays. However heterogeneity in clinical samples, due to stromal contamination and somatic alterations, complicates analysis and interpretation of these data. Methods: We have developed MixHMM, a novel hidden Markov model using hidden states based on chromosomal structural aberrations. MixHMM allows CNV detection for copy numbers up to 7 and allows more complete and accurate description of other forms of allelic imbalance, such as increased copy number LOH or imbalanced amplifications. MixHMM also incorporates a novel sample mixing model that allows detection of tumor CNV events in heterogeneous tumor samples, where cancer cells are mixed with a proportion of stromal cells. Conclusions: We validate MixHMM and demonstrate its advantages with simulated samples, clinical tumor samples and a dilution series of mixed samples. We have shown that the CNVs of cancer cells in a tumor sample contaminated with up to 80% of stromal cells can be detected accurately using Illumina BeadChip and MixHMM.
引用
收藏
页数:14
相关论文
共 4 条
  • [1] Precise inference of copy number alterations in tumor samples from SNP arrays
    Chen, Gary K.
    Chang, Xiao
    Curtis, Christina
    Wang, Kai
    BIOINFORMATICS, 2013, 29 (23) : 2964 - 2970
  • [2] Copy number variation detection using SNP genotyping arrays in three Chinese pig breeds
    Dong, K.
    Pu, Y.
    Yao, N.
    Shu, G.
    Liu, X.
    He, X.
    Zhao, Q.
    Guan, W.
    Ma, Y.
    ANIMAL GENETICS, 2015, 46 (02) : 101 - 109
  • [3] GPHMM: an integrated hidden Markov model for identification of copy number alteration and loss of heterozygosity in complex tumor samples using whole genome SNP arrays
    Li, Ao
    Liu, Zongzhi
    Lezon-Geyda, Kimberly
    Sarkar, Sudipa
    Lannin, Donald
    Schulz, Vincent
    Krop, Ian
    Winer, Eric
    Harris, Lyndsay
    Tuck, David
    NUCLEIC ACIDS RESEARCH, 2011, 39 (12) : 4928 - 4941
  • [4] A genome-wide detection of copy number variation using SNP genotyping arrays in Beijing-You chickens
    Zhou, Wei
    Liu, Ranran
    Zhang, Jingjing
    Zheng, Maiqing
    Li, Peng
    Chang, Guobin
    Wen, Jie
    Zhao, Guiping
    GENETICA, 2014, 142 (05) : 441 - 450