Recommendations for Accurate Resolution of Gene and Isoform Allele-Specific Expression in RNA-Seq Data

被引:13
|
作者
Wood, David L. A. [1 ]
Nones, Katia [1 ]
Steptoe, Anita [1 ]
Christ, Angelika [1 ]
Harliwong, Ivon [1 ]
Newell, Felicity [1 ]
Bruxner, Timothy J. C. [1 ]
Miller, David [1 ]
Cloonan, Nicole [2 ]
Grimmond, Sean M. [1 ,3 ]
机构
[1] Univ Queensland, Queensland Ctr Med Genom, Brisbane, Qld, Australia
[2] QIMR Berghofer Med Res Inst, Herston, Qld 4006, Australia
[3] Univ Glasgow, Translat Res Ctr, Glasgow, Lanark, Scotland
来源
PLOS ONE | 2015年 / 10卷 / 05期
基金
澳大利亚研究理事会;
关键词
HUMAN GENOME; TRANSCRIPTOME; HUMANS; METHYLATION; IMBALANCE; SEQUENCE; DISEASE; READS; RISK;
D O I
10.1371/journal.pone.0126911
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Genetic variation modulates gene expression transcriptionally or post-transcriptionally, and can profoundly alter an individual's phenotype. Measuring allelic differential expression at heterozygous loci within an individual, a phenomenon called allele-specific expression (ASE), can assist in identifying such factors. Massively parallel DNA and RNA sequencing and advances in bioinformatic methodologies provide an outstanding opportunity to measure ASE genome-wide. In this study, matched DNA and RNA sequencing, genotyping arrays and computationally phased haplotypes were integrated to comprehensively and conservatively quantify ASE in a single human brain and liver tissue sample. We describe a methodological evaluation and assessment of common bioinformatic steps for ASE quantification, and recommend a robust approach to accurately measure SNP, gene and isoform ASE through the use of personalized haplotype genome alignment, strict alignment quality control and intragenic SNP aggregation. Our results indicate that accurate ASE quantification requires careful bioinformatic analyses and is adversely affected by sample specific alignment confounders and random sampling even at moderate sequence depths. We identified multiple known and several novel ASE genes in liver, including WDR72, DSP and UBD, as well as genes that contained ASE SNPs with imbalance direction discordant with haplotype phase, explainable by annotated transcript structure, suggesting isoform derived ASE. The methods evaluated in this study will be of use to researchers performing highly conservative quantification of ASE, and the genes and isoforms identified as ASE of interest to researchers studying those loci.
引用
收藏
页数:27
相关论文
共 50 条
  • [31] Using RNA-Seq Data to Evaluate Reference Genes Suitable for Gene Expression Studies in Soybean
    Yim, Aldrin Kay-Yuen
    Wong, Johanna Wing-Hang
    Ku, Yee-Shan
    Qin, Hao
    Chan, Ting-Fung
    Lam, Hon-Ming
    PLOS ONE, 2015, 10 (09):
  • [32] SFARI genes and where to find them; modelling Autism Spectrum Disorder specific gene expression dysregulation with RNA-seq data
    Arpi, Magdalena Navarro Torres
    Simpson, T. Ian
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [33] Alternating EM algorithm for a bilinear model in isoform quantification from RNA-seq data
    Deng, Wenjiang
    Mou, Tian
    Kalari, Krishna R.
    Niu, Nifang
    Wang, Liewei
    Pawitan, Yudi
    Trung Nghia Vu
    BIOINFORMATICS, 2020, 36 (03) : 805 - 812
  • [34] Comparative evaluation of isoform-level gene expression estimation algorithms for RNA-seq and exon-array platforms
    Dapas, Matthew
    Kandpal, Manoj
    Bi, Yingtao
    Davuluri, Ramana V.
    BRIEFINGS IN BIOINFORMATICS, 2017, 18 (02) : 260 - 269
  • [35] Cell type-specific gene expression profiling in brain tissue: comparison between TRAP, LCM and RNA-seq
    Kim, TaeHyun
    Lim, Chae-Seok
    Kaang, Bong-Kiun
    BMB REPORTS, 2015, 48 (07) : 388 - 394
  • [36] Haplotype-resolved genome assembly and allele-specific gene expression in cultivated ginger
    Cheng, Shi-Ping
    Jia, Kai-Hua
    Liu, Hui
    Zhang, Ren-Gang
    Li, Zhi-Chao
    Zhou, Shan-Shan
    Shi, Tian-Le
    Ma, Ai-Chu
    Yu, Cong-Wen
    Gao, Chan
    Cao, Guang-Lei
    Zhao, Wei
    Nie, Shuai
    Guo, Jing-Fang
    Jiao, Si-Qian
    Tian, Xue-Chan
    Yan, Xue-Mei
    Bao, Yu-Tao
    Yun, Quan-Zheng
    Wang, Xin-Zhu
    Porth, Ilga
    El-Kassaby, Yousry A.
    Wang, Xiao-Ru
    Li, Zhen
    Van de Peer, Yves
    Mao, Jian-Feng
    HORTICULTURE RESEARCH, 2021, 8 (01)
  • [37] Strand-Specific RNA-Seq Provides Greater Resolution of Transcriptome Profiling
    Mills, James Dominic
    Kawahara, Yoshihiro
    Janitz, Michael
    CURRENT GENOMICS, 2013, 14 (03) : 173 - 181
  • [38] RNAflow: An Effective and Simple RNA-Seq Differential Gene Expression Pipeline Using Nextflow
    Lataretu, Marie
    Hoelzer, Martin
    GENES, 2020, 11 (12) : 1 - 17
  • [39] Methods for Quantifying Gene Expression in Ecoimmunology: From qPCR to RNA-Seq
    Fassbinder-Orth, Carol A.
    INTEGRATIVE AND COMPARATIVE BIOLOGY, 2014, 54 (03) : 396 - 406
  • [40] Critical Evaluation of Imprinted Gene Expression by RNA-Seq: A New Perspective
    DeVeale, Brian
    van der Kooy, Derek
    Babak, Tomas
    PLOS GENETICS, 2012, 8 (03):