Rare variant phasing and haplotypic expression from RNA sequencing with phASER

被引:0
作者
Stephane E. Castel
Pejman Mohammadi
Wendy K. Chung
Yufeng Shen
Tuuli Lappalainen
机构
[1] New York Genome Center,Department of Systems Biology
[2] Columbia University,Departments of Pediatrics and Medicine
[3] Columbia University,Department of Biomedical Informatics
[4] Columbia University,undefined
来源
Nature Communications | / 7卷
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Haplotype phasing of genetic variants is important for clinical interpretation of the genome, population genetic analysis and functional genomic analysis of allelic activity. Here we present phASER, an accurate approach for phasing variants that are overlapped by sequencing reads, including those from RNA sequencing (RNA-seq), which often span multiple exons due to splicing. Using diverse RNA-seq data we demonstrate that this provides more accurate phasing of rare variants compared with population-based phasing and allows phasing of variants in the same gene up to hundreds of kilobases away that cannot be obtained from DNA sequencing (DNA-seq) reads. We show that in the context of medical genetic studies this improves the resolution of compound heterozygotes. Additionally, phASER provides measures of haplotypic expression that increase power and accuracy in studies of allelic expression. In summary, phasing using RNA-seq and phASER is accurate and improves studies where rare variant haplotypes or allelic expression is needed.
引用
收藏
相关论文
共 33 条
  • [1] Roach JC(2011)Chromosomal haplotypes by genetic phasing of human families Am. J. Hum. Genet. 89 382-397
  • [2] Delaneau O(2012)A linear complexity phasing method for thousands of genomes Nat. Methods 9 179-181
  • [3] Marchini J(2011)Haplotype phasing: existing methods and new developments Nat. Rev. Genet. 12 703-714
  • [4] Zagury J-F(2014)Whole-genome haplotyping using long reads and statistical methods Nat. Biotechnol. 32 261-266
  • [5] Browning SR(2015)Assembly and diploid architecture of an individual human genome via single-molecule technologies Nat. Methods 12 780-786
  • [6] Browning BL(2015)Tools and best practices for data processing in allelic expression analysis Genome Biol. 16 195-2252
  • [7] Kuleshov V(2013)Leveraging reads that span multiple single nucleotide polymorphisms for haplotype inference from sequencing data Bioinformatics 29 2245-i159
  • [8] Pendleton M(2008)HapCUT: an efficient and accurate algorithm for the haplotype assembly problem Bioinformatics 24 i153-696
  • [9] Castel SE(2013)Haplotype estimation using sequencing reads Am. J. Hum. Genet. 93 687-256
  • [10] Levy-Moonshine A(2014)Transcriptome sequencing of a large human family identifies the impact of rare noncoding variants Am. J. Hum. Genet. 95 245-665