Estimates of allele-specific expression in Drosophila with a single genome sequence and RNA-seq data

被引:9
作者
Quinn, Andrew [1 ]
Juneja, Punita [1 ]
Jiggins, Francis M. [1 ]
机构
[1] Univ Cambridge, Dept Genet, Cambridge CB2 3EH, England
基金
欧洲研究理事会;
关键词
HAPLOTYPE; IMBALANCE; GENE; BIAS; TOOL;
D O I
10.1093/bioinformatics/btu342
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Genetic variation in cis-regulatory elements is an important cause of variation in gene expression. Cis-regulatory variation can be detected by using high-throughput RNA sequencing (RNA-seq) to identify differences in the expression of the two alleles of a gene. This requires that reads from the two alleles are equally likely to map to a reference genome(s), and that single-nucleotide polymorphisms (SNPs) are accurately called, so that reads derived from the different alleles can be identified. Both of these prerequisites can be achieved by sequencing the genomes of the parents of the individual being studied, but this is often prohibitively costly. Results: In Drosophila, we demonstrate that biases during read mapping can be avoided by mapping reads to two alternative genomes that incorporate SNPs called from the RNA-seq data. The SNPs can be reliably called from the RNA-seq data itself, provided any variants not found in high-quality SNP databases are filtered out. Finally, we suggest a way of measuring allele-specific expression (ASE) by crossing the line of interest to a reference line with a high-quality genome sequence. Combined with our bioinformatic methods, this approach minimizes mapping biases, allows poor-quality data to be identified and removed and aides in the biological interpretation of the data as the parent of origin of each allele is known. In conclusion, our results suggest that accurate estimates of ASE do not require the parental genomes of the individual being studied to be sequenced.
引用
收藏
页码:2603 / 2610
页数:8
相关论文
共 25 条
[1]   The genome sequence of Drosophila melanogaster [J].
Adams, MD ;
Celniker, SE ;
Holt, RA ;
Evans, CA ;
Gocayne, JD ;
Amanatides, PG ;
Scherer, SE ;
Li, PW ;
Hoskins, RA ;
Galle, RF ;
George, RA ;
Lewis, SE ;
Richards, S ;
Ashburner, M ;
Henderson, SN ;
Sutton, GG ;
Wortman, JR ;
Yandell, MD ;
Zhang, Q ;
Chen, LX ;
Brandon, RC ;
Rogers, YHC ;
Blazej, RG ;
Champe, M ;
Pfeiffer, BD ;
Wan, KH ;
Doyle, C ;
Baxter, EG ;
Helt, G ;
Nelson, CR ;
Miklos, GLG ;
Abril, JF ;
Agbayani, A ;
An, HJ ;
Andrews-Pfannkoch, C ;
Baldwin, D ;
Ballew, RM ;
Basu, A ;
Baxendale, J ;
Bayraktaroglu, L ;
Beasley, EM ;
Beeson, KY ;
Benos, PV ;
Berman, BP ;
Bhandari, D ;
Bolshakov, S ;
Borkova, D ;
Botchan, MR ;
Bouck, J ;
Brokstein, P .
SCIENCE, 2000, 287 (5461) :2185-2195
[2]   HTSeq-a Python']Python framework to work with high-throughput sequencing data [J].
Anders, Simon ;
Pyl, Paul Theodor ;
Huber, Wolfgang .
BIOINFORMATICS, 2015, 31 (02) :166-169
[3]   Genetic validation of whole-transcriptome sequencing for mapping expression affected by cis-regulatory variation [J].
Babak, Tomas ;
Garrett-Engele, Philip ;
Armour, Christopher D. ;
Raymond, Christopher K. ;
Keller, Mark P. ;
Chen, Ronghua ;
Rohl, Carol A. ;
Johnson, Jason M. ;
Attie, Alan D. ;
Fraser, Hunter B. ;
Schadt, Eric E. .
BMC GENOMICS, 2010, 11
[4]   Accurate identification of A-to-I RNA editing in human by transcriptome sequencing [J].
Bahn, Jae Hoon ;
Lee, Jae-Hyung ;
Li, Gang ;
Greer, Christopher ;
Peng, Guangdun ;
Xiao, Xinshu .
GENOME RESEARCH, 2012, 22 (01) :142-150
[5]   HapCUT: an efficient and accurate algorithm for the haplotype assembly problem [J].
Bansal, Vikas ;
Bafna, Vineet .
BIOINFORMATICS, 2008, 24 (16) :I153-I159
[6]   Trimmomatic: a flexible trimmer for Illumina sequence data [J].
Bolger, Anthony M. ;
Lohse, Marc ;
Usadel, Bjoern .
BIOINFORMATICS, 2014, 30 (15) :2114-2120
[7]   Polygenic and directional regulatory evolution across pathways in Saccharomyces [J].
Bullard, James H. ;
Mostovoy, Yulia ;
Dudoit, Sandrine ;
Brem, Rachel B. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2010, 107 (11) :5058-5063
[8]   Genomic Imprinting Absent in Drosophila melanogaster Adult Females [J].
Coolon, Joseph D. ;
Stevenson, Kraig R. ;
McManus, C. Joel ;
Graveley, Brenton R. ;
Wittkopp, Patricia J. .
CELL REPORTS, 2012, 2 (01) :69-75
[9]   Effect of read-mapping biases on detecting allele-specific expression from RNA-sequencing data [J].
Degner, Jacob F. ;
Marioni, John C. ;
Pai, Athma A. ;
Pickrell, Joseph K. ;
Nkadori, Everlyne ;
Gilad, Yoav ;
Pritchard, Jonathan K. .
BIOINFORMATICS, 2009, 25 (24) :3207-3212
[10]   Systematic Detection of Polygenic cis-Regulatory Evolution [J].
Fraser, Hunter B. ;
Babak, Tomas ;
Tsang, John ;
Zhou, Yiqi ;
Zhang, Bin ;
Mehrabian, Margarete ;
Schadt, Eric E. .
PLOS GENETICS, 2011, 7 (03)