An automated quality control pipeline for eQTL analysis with RNA-seq data

被引:0
作者
Wang, Tao [1 ]
Ruan, Junpeng [2 ]
Yin, Quanwei [2 ]
Dong, Xianjun [3 ]
Wang, Yadong [1 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin, Peoples R China
[2] Northwestern Polytech Univ, Sch Comp Sci, Xian, Peoples R China
[3] Harvard Med Sch, Brigham & Womens Hosp, Boston, MA 02115 USA
来源
2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM) | 2019年
关键词
eQTL; quality control; pipeline; RNA-seq; geno-type; GENOME-WIDE ASSOCIATION; GENE-EXPRESSION; METAANALYSIS;
D O I
暂无
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Expression quantitative trait loci (eQTL) analysis is of critical importance to understand the mechanism underlying trait associated variants. Evaluating and controlling the data quality of transcripts and genotypes, which are basis of eQTL analysis, remains challenging for researchers with limited computational backgrounds. There is a strong need for a user-friendly and comprehensive tool to pre-process those data sets automatically. Here we propose such a solution, eQTLQC, an automated quality control pipeline for preprocessing both RNA-seq and genotype data. The eQTLQC pipeline provides multiple informative quality control measurements and data normalization approaches. And it provides a easy-to-use configuration file for users to flexibly set up the parameters and control the pipeline. We demonstrate its utility by performing RNA-seq and genotype preprocessing on real data sets.
引用
收藏
页码:1780 / 1786
页数:7
相关论文
共 24 条
  • [1] Genetic effects on gene expression across human tissues
    Aguet, Francois
    Brown, Andrew A.
    Castel, Stephane E.
    Davis, Joe R.
    He, Yuan
    Jo, Brian
    Mohammadi, Pejman
    Park, Yoson
    Parsana, Princy
    Segre, Ayellet V.
    Strober, Benjamin J.
    Zappala, Zachary
    Cummings, Beryl B.
    Gelfand, Ellen T.
    Hadley, Kane
    Huang, Katherine H.
    Lek, Monkol
    Li, Xiao
    Nedzel, Jared L.
    Nguyen, Duyen Y.
    Noble, Michael S.
    Sullivan, Timothy J.
    Tukiainen, Taru
    MacArthur, Daniel G.
    Getz, Gad
    Management, Nih Program
    Addington, Anjene
    Guan, Ping
    Koester, Susan
    Little, A. Roger
    Lockhart, Nicole C.
    Moore, Helen M.
    Rao, Abhi
    Struewing, Jeffery P.
    Volpi, Simona
    Collection, Biospecimen
    Brigham, Lori E.
    Hasz, Richard
    Hunter, Marcus
    Johns, Christopher
    Johnson, Mark
    Kopen, Gene
    Leinweber, William F.
    Lonsdale, John T.
    McDonald, Alisa
    Mestichelli, Bernadette
    Myer, Kevin
    Roe, Bryan
    Salvatore, Michael
    Shad, Saboor
    [J]. NATURE, 2017, 550 (7675) : 204 - +
  • [2] Data quality control in genetic case-control association studies
    Anderson, Carl A.
    Pettersson, Fredrik H.
    Clarke, Geraldine M.
    Cardon, Lon R.
    Morris, Andrew P.
    Zondervan, Krina T.
    [J]. NATURE PROTOCOLS, 2010, 5 (09) : 1564 - 1573
  • [3] Bennett DA, 2012, CURR ALZHEIMER RES, V9, P646
  • [4] Bennett DA, 2012, CURR ALZHEIMER RES, V9, P628
  • [5] A meta-analysis of genome-wide association studies identifies 17 new Parkinson's disease risk loci
    Chang, Diana
    Nalls, Mike A.
    Hallgrimsdottir, Ingileif B.
    Hunkapiller, Julie
    van der Brug, Marcel
    Cai, Fang
    Kerchner, Geoffrey A.
    Ayalon, Gai
    Bingol, Baris
    Sheng, Morgan
    Hinds, David
    Behrens, Timothy W.
    Singleton, Andrew B.
    Bhangale, Tushar R.
    Graham, Robert R.
    [J]. NATURE GENETICS, 2017, 49 (10) : 1511 - +
  • [6] A complete tool set for molecular QTL discovery and analysis
    Delaneau, Olivier
    Ongen, Halit
    Brown, Andrew A.
    Fort, Alexandre
    Panousis, Nikolaos I.
    Dermitzakis, Emmanouil T.
    [J]. NATURE COMMUNICATIONS, 2017, 8
  • [7] Enhancers active in dopamine neurons are a primary link between genetic variation and neuropsychiatric disease
    Dong, Xianjun
    Liao, Zhixiang
    Gritsch, David
    Hadzhiev, Yavor
    Bai, Yunfei
    Locascio, Joseph J.
    Guennewig, Boris
    Liu, Ganqiang
    Blauwendraat, Cornelis
    Wang, Tao
    Adler, Charles H.
    Hedreen, John C.
    Faull, Richard L. M.
    Frosch, Matthew P.
    Nelson, Peter T.
    Rizzu, Patrizia
    Cooper, Antony A.
    Heutink, Peter
    Beach, Thomas G.
    Mattick, John S.
    Mueller, Ferenc
    Scherzer, Clemens R.
    [J]. NATURE NEUROSCIENCE, 2018, 21 (10) : 1482 - +
  • [8] Genetics of gene expression in primary immune cells identifies cell type-specific master regulators and roles of HLA alleles
    Fairfax, Benjamin P.
    Makino, Seiko
    Radhakrishnan, Jayachandran
    Plant, Katharine
    Leslie, Stephen
    Dilthey, Alexander
    Ellis, Peter
    Langford, Cordelia
    Vannberg, Fredrik O.
    Knight, Julian C.
    [J]. NATURE GENETICS, 2012, 44 (05) : 502 - +
  • [9] Adjusting batch effects in microarray expression data using empirical Bayes methods
    Johnson, W. Evan
    Li, Cheng
    Rabinovic, Ariel
    [J]. BIOSTATISTICS, 2007, 8 (01) : 118 - 127
  • [10] Multiplexed droplet single-cell RNA-sequencing using natural genetic variation
    Kang, Hyun Min
    Subramaniam, Meena
    Targ, Sasha
    Michelle Nguyen
    Maliskova, Lenka
    McCarthy, Elizabeth
    Wan, Eunice
    Wong, Simon
    Byrnes, Lauren
    Lanata, Cristina M.
    Gate, Rachel E.
    Mostafavi, Sara
    Marson, Alexander
    Zaitlen, Noah
    Criswell, Lindsey A.
    Ye, Chun Jimmie
    [J]. NATURE BIOTECHNOLOGY, 2018, 36 (01) : 89 - +